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NUCLEIC ACIDS AND PROTEINS 
RELATED TO ALZHEIMER'S DISEASE, 
AND USES THEREFOR 

Field of the Invention 

The present invention relates generally to the Held of neurological and 
physiological dysfunctions associated with Alzheimer's Disease. More particularly, 
the invention is concerned with the identification, isolation and cloning of genes 
which are associated with Alzheimer's Disease, as well as their corresponding 
transcripts and protein products. The present invention also relates to methods for 
detecting and diagnosing carriers of normal and mutant alleles of these genes, to 
methods for detecting and diagnosing Alzheimer's Disease, to methods of identifying 
15 other genes and proteins related to, or interacting with, the genes and proteins of the 
invention, to methods of screening for potential therapeutics for Alzheimer's Disease, 
to methods of treatment for Alzheimer's Disease, and to cell lines and animal models' 

useful in screening for and evaluating potentially useful therapies for Alzheimer's 
Disease. 

Backeroun d of the Invention 
Alzheimer's Disease (AD) is a degenerative disorder of the human central 
nervous system characterized by progressive memory impairment and cognitive and 
intellectual decline during mid to late adult life (Katzman, 1986). The disease is 
accompanied by a constellation of neuro-pathologic features principal amongst which 
are the presence of extracellular amyloid or senile plaques, and neurofibrillary tangles 
in neurons. The etiology of this disease is complex, although in some families it 
appears to be inherited as an autosomal dominant trait. Linkage studies have 
identified three genes associated with the developmem of AD: (i-amyloid precursor 
protein (APP) (Chartier-Hariin et al.. 1991; Goate et al., 1991; Murrell et al. 1991- 
Karlinsky et al., 1992; Mullan et al., 1992), presenilin-1 (PS-1) (Sherrington, 1995^ 
and presenilin-2 (PS-2) (Rogaev, 1 995. and Levy-Lahad. 1 995). 

The presenilins are multi-spanning membrane proteins which were described 
in substantial detail in PCT Publication WO96/34099. the entire disclosure of which 
.s incorporated herein by reference. Although the functions of the presenilins are 
unknown, a number of autosomal dominant presenilin mutations have been identified 
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which are strongly associated with the development of early-onset, aggressive. 
Familial Alzheimer's Disease (FAD). 

The present disclosure describes the identification, isolation, sequencing and 
characterization of several human genes which interact with the presenilins, mutations 
5 in which may lead to FAD. These presenilin-interacting protein genes may be 
involved in the pathways which, when affected by mutant presenilins, lead to the 
development of Alzheimer's Disease. In addition, mutations in the presenilin- 
interacting protein genes, even in the absence of defects in the presenilins, nlay be 
causative of Alzheimer's Disease. 

10 Summary of the Invention 

The present invention is based, in part, upon the identification, isolation, 
sequencing and characterization of several human genes, referred to herein as 
"presenilin-interacting protein genes" or "PS-interacting protein genes." The products 
of these genes are believed to interact in vivo with the human presenilin-l proteins 

15 and, therefore, are implicated in the biochemical pathways which are affected in 

Alzheimer's Disease, Each of these genes, therefore, presents a new therapeutic target 
for the treatment of Alzheimer's Disease. In addition, PS-intera:cting protein nucleic 
acids, PS-interacting proteins and peptides, antibodies to the PS -interacting proteins, 
cells transformed with PS-interacting protein nucleic acids, and transgenic animals 

20 altered with PS-interacting protein nucleic acids, all possess various utilities, as 

described herein, for the diagnosis, therapy and continued investigation of Alzheimer's 
Disease and related disorders. 

Thus, it is one object of the invention to provide isolated nucleic acids 
encoding at least a PS-interacting domain of a PS-interacting protein. These PS- 
25 interacting proteins include mammalian S5a subunits of the 26S proteasome, the 
GT24 protein, the p0071 protein, the Rabl 1 protein, the retinoid X receptor-p, the 
cytoplasmic chaperonin, and several sequences identified herein as clones Y2H35, 
Y2H171, and Y2H41 , Preferred nucleotide and amino acid sequences are provided 
herein. It is another object of the invention to provide probes and primers for these 
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PS-interacting protein genes, and to provide nucleic acids which encode small 
antigenic determinants of these genes. Therefore, prefeired embodiments include 
sequences of at least 1 0. 1 5 or 20 consecutive nucleotides selected from the disclosed 
sequences. 

Using the nucleic acid sequences and antibodies disclosed and enabled 
herein, methods for identifying allelic variants or heterospecific homologues of a 
human PS-interacting protein and gene are provided. The methods may be practiced 
using nucleic acid hybridization or amplification techniques, immunochemical 
techniques, or any other technique known in the art. The allelic variants may include 
other normal human alleles as well as mutant alleles of the PS-interacting protein 
genes which may be causative of Alzheimer's Disease. The heterospecific 
homologues may be from other mammalian species, such as mice, rats, dogs, cats or 
non-human primates, or may be from invertebrate species, such as Drosonhila or C 
eiegans. Thus, it is another object of the invention to provide nucleic acids which 
encode allelic or heterospecific variants of the disclosed sequences, as well as the 
allelic or heterospecific proteins encoded by them. 



The it another object of the invention to provide vectors, and particularly 
expression vectors, which include any of the above-described nucleic acids. It is a 
further object of the invention to provide vectors in which PS-interacting protein 
nucleic acid sequences are operably joined to exogenous regulatory regions to produce 
altered patterns of expression, or to exogenous coding regions to produce fusion 
proteins. Conversely, it is another object to provide nucleic acids in which PS- 
interactiiig protein regulatory regions are operably joined to exogenous coding 
regions, including standard marker genes, to produce constructs in which the 
25 regulation of PS-interacting protein genes may be studied and used in assays for 
therapeutics. 

It is another object of the invention to provide host cells and transgenic 
animals which have been transformed with any of the above-described nucleic acids 
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of the invention. The host cells may be prokaryotic or eukaryotic cells and, in 
particular, may be gametes, zygotes, fetal cells, or stem cells useful in producing 
transgenic animal models. 

In particularly prefenred embodiments, the present invention provides a 
5 non-human animal model for Alzheimer's Disease, in which the genome of the 
animal, or an ancestor thereof, has been modified by at least one recombinant 
construct which has introduced one of the following modifications: (1) insertion of 
nucleotide sequences encoding at least a functional domain of a heterospecific normal 
PS-interacting protein, (2) insertion of nucleotide sequences encoding at least a 
10 fiinctional domain of a heterospecific mutant PS-interacting protein, (3) insertion of 
nucleotide sequences encoding at least a functional domain of a conspecific 
homologue of a heterospecific mutant PS-interacting protein, and (4) inactivation of 
an endogenous PS-interacting protein gene. Preferred transgenic animal models are 
rats, mice, hamsters, guinea pigs, rabbits, dogs, cats, goats, sheep, pigs, and non- 
15 human primates, but invertebrates are also contemplated for certain utilities. 

It is another object of the invention to provide methods for producing at 
least a fimctional domain of a PS-interacting protein using the nucleic acids of the 
invention. In addition, the present invention also provides substantially pure 
preparations of such proteins, including short peptide sequences for used as 
20 immunogens. Thus, the invention provides peptides comprising at least 10 or 15 

consecutive amino acid residues from the disclosed and otherwise enabled sequences. 
The invention further provides substantially pure preparations of peptides which 
comprise at least a PS-interacting domain of a PS-interacting protein, as well as 
substantially pure preparations of the entire proteins. 

25 Using the substantially pure peptides and proteins enabled herein, the 

invention also provides methods for producing antibodies which selectively bind to a 
PS-interacting protein, as well as cell lines which produce these antibodies. 



SUBSTITUTE SHEET (RULE 26) 



wo 97/27296 



PCT/CA97/00051 



10 



15 



20 



25 



Another object of the present invention is to provide methods of 
identifying compounds which may have utility in the treatment of Alzheimer's 
Disease and related disorders. These methods include methods for identifying 
compounds which can modulate the expression of a PS-interacting protein gene, 
methods for identifying compounds which can selectively bind to a PS-interacting 
protein, and methods of identifying compounds which can modulate activity of a PS- 
interacting protein. These methods may be conducted in vitro or in vivo , and may 
employ the transformed cell lines and transgenic animal models of the invention. The 
methods also may be part of a clinical trial in which compounds identified by the 
methods of the invention are further tested in human subjects. 

It is another object of the invention to provide methods of diagnosing or 
screening for inherited forms of Alzheimer's Disease by detennining if a subject bears 
a mutant PS-interacting protein gene. Mutant PS-interacting genes may be detected 
by assays including direct nucleotide sequencing, probe specific hybridization, 
restriction enzyme digest and mapping, PGR mapping, ligase-mediated PGR 
detection, RNase protection, electrophoretic mobility shift detection, or chemical 
mismatch cleavage. Alternatively, mutant forms of a PS-interacting protein may be 
detected by assays including immunoassays, protease assays, or electrophoretic 
mobility assays. 

It is also an object of the invention to provide pharmaceutical preparations 
which may be used in the treatment of Alzheimer's Disease and related disorders 
which resuh from aberration in biochemical pathways involving the PS-interacting 
proteins disclosed and enabled herein. Thus, the present invention also provides 
pharmaceutical preparations comprising a substantially pure PS-intemcting protein, an 
expression vector operably encoding a PS -interacting protein, an expression vector 
operably encoding a PS-interacting protein antisense sequence, an antibody which 
selectively binds to a mutant PS-interacting protein, or an antigenic determinant of a 
mutant PS-interacting protein. These phamiaceutical pr^arations may be used to 
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treat a patient bearing a mutant PS-interacting protein gene which is causative of 
Alzheimer's Disease or related disorders. 

These an other objects of the present invention are described more fully in 
the following specification and appended claims, 

5 ^ Detailed Description of the Invention 

I. Definitions 

In order to facilitate review of the various embodiments of the invention, 
and an understanding of the various elements and constituents used in making and 
using the invention, the following definitions are provided for particular terms used in 

10 the description and appended claims: 

Presenilin. As used without further modification herein, the tenns 
"presenilin" or "presenilins" mean the presenilin- 1 (PSl) and/or the presenilin-2 (PS2) 
genes/proteins. In particular, the unmodified tenns "presenilin" or "presenilins" refer 
to the mammalian PSl and/br PS2 genes/proteins and, preferably, the human PSl 

15 and/or PS2 genes/proteins as described and disclosed in PCT Publication 
WO96/34099. 

Normal. As used herein with respect to genes, the tenn "normal" refers to 
a gene which encodes a normal protein. As used herein with respect to proteins, the 
term "normal" means a protein which performs its usual or normal physiological role 

20 and which is not associated with, or causative of, a pathogenic condition or state. 

Therefore, as used herein, the term "normal" is essentially synonymous with the usual 
meaning of the phrase "wild type." For any given gene, or corresponding protein, a 
multiplicity of normal allelic variants may exist, none of which is associated with the 
development of a pathogenic condition or state. Such normal allelic variants include, 

25 but are not limited to, variants in which one or more nucleotide substitutions do not 
result in a change in the encoded amino acid sequence. 

Mutant. As used herein with respect to genes, the term "mutant" refers to 
a gene which encodes a mutant protein. As used herein with respect to proteins, the 
term "mutant" means a protein which does not perform its usual or normal 
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physiological role and which is associated with, or causative of, a pathogenic 
condition or state. Therefore, as used herein, the tenn "mutant" is essentially 
synonymous with the tenns "dysfunctional," "pathogenic." "disease-causing," and 
"deleterious." With respect to the presenilin and presenilin-interacting protein genes 
' and proteins of the present invention, the term "mutant" refers to genes/proteins 
bearing one or more nucleotide/amino acid substitutions, insertions and/or deletions 
which typically lead to the development of the symptoms of Alzheimer's Disease 
and/or other relevant inheritable phenotypes (e.g. cerebral hemorrhage, mental 
retardation, schizophrenia, psychosis, and depression) when expressed in humans. 
This definition is understood to include the various mutations that naturally exist, 
including but not limited to those disclosed herein, as well as synthetic or recombinant 
mutations produced by human intervention. The tenn "mutant." as applied to these 
genes, is not intended to embrace sequence variants which, due to the degeneracy of 
the genetic code, encode proteins identical to the normal sequences disclosed or 
otherwise enabled herein; nor is it intended to embrace sequence variants which, 
although they encode different proteins, encode proteins which are functionally 
equivalent to normal proteins. 

Substantially pure As used herein with respect to proteins (including 
antibodies) or other preparations, the term "substantially pure" means that the 

preparation is essentially free of other substances to an extent practical and 
appropriate for its intended use. In particular, a protein preparation is substantially 
pure if it is sufficiently fi-ee from other biological constituents so as to be usefi.1 in. for 
example, generating antibodies, sequencing, or producing pharmaceutical 
preparations. By techniques well known in the art. substantially pure proteins or 
peptides may be produced in light of the nucleic acid and amino acid sequences 
disclosed herein. In particular, in light of the nucleic acid and amino acid sequences 
disclosed herein, one of ordinary skill in the art may. by application or serial 
application of well-known methods including HPLC or immuno-affinity 
chromatography or electrophoretic separation, obtain proteins or peptides of any 
generally feasible purity. Preferably, but not necessarily, "substantially pure" 
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preparations include at least 60% by weight (dry weight) the compound of interest. 
More preferably the preparation is at least 75% or 90%, and most preferably at least 
99%, by weight the compound of interest. Purity can be measured by any appropriate 
method, e.g., column chromatography, gel electrophoresis, or HPLC analysis. With 
5 respect to proteins, including antibodies, if a preparation includes two or more 

different compounds of interest (e.g., two or more different antibodies, immunogens, 
functional domains, or other polypeptides of the invention), a "substantially pure" 
preparation is preferably one in which the total weight (dry weight) of all the 
compounds of interest is at least 60% of the total dry weight. Similarly, for such 

10 preparations containing two or more compounds of interest, it is preferred that the 
total weight of the compounds of interest be at least 75%, more preferably at least 
90%, and most preferably at least 99%, of the total dry weight of the preparation. 
Finally, in the event that the protein of interest is mixed with one or more other 
proteins (e.g., serum albumin) or compounds (e.g., diluents, excipients, salts, 

15 polysaccharides, sugars, lipids) for purposes of administration, stability, storage, and 
the like, such other proteins or compounds may be ignored in calculation of the purity 
of the preparation. 

Isolated nucleic acid. As used herein, an "isolated nucleic acid" is a 
ribonucleic acid, deoxyribonucleic acid, or nucleic acid analog comprising a 

20 polynucleotide sequence that is isolated or separate from sequences that are 

immediately contiguous (one on the 5' end and one on the 3' end) in the naturally 
occurring genome of the organism from which it is derived, The term therefore 
includes, for example, a recombinant nucleic acid which is incorporated into a vector, 
into an autonomously replicating plasmid or virus, or into the genomic DNA of a 

25 prokaryote or eukaryote; or which exists as a separate molecule (e.g., a cDNA or a 
genomic DNA fragment produced by PGR or restriction endonuclease treatment) 
independent of other sequences. It also includes a recombinant DNA which is part of 
a hybrid gene encoding additional polypeptide sequences and/or including exogenous 
regulatory elements. 
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Substantiailv identical seouenrp As used herein, a "substantially 
identical" amino acid sequence is an amino acid sequence which differs only by 
conservative amino acid substitutions, for example, substitution of one amino acid for 
another of the same class (e.g.. vahne for glycine, arginine for lysine, etc.) or by one 
5 or more non-conservative substitutions, deletions, or insertions located at positions of 
the amino acid sequence which do not destroy the function of the protein (assayed, 
e.g., as described herein). Preferably, such a sequence is at least 85%, more 
preferably 90%, and most preferably 95% identical at the amino acid level to the 
sequence of the protein or peptide to which it is being compared. For nucleic acids, 
10 the length of comparison sequences will generally be at least 50 nucleotides, 

preferably at least 60 nucleotides, more preferably at least 75 nucleotides, and most 
preferably 1 10 nucleotides. A "substantially identical" nucleic acid sequence codes 
for a substantially identical amino acid sequence as defined above. 

Transformed cell. As used herein, a "transformed cell" is a cell imo which 
(or into an ancestor of which) has been introduced, by means of recombinant DNA 
techniques, a nucleic acid molecule of interest. The nucleic acid of interest will 
typically encode a peptide or protein. The transformed cell may express the sequence 
of interest or may be used only to propagate the sequence. The term "transformed" 
may be used herein to embrace any method of introducing exogenous nucleic acids 
including, but not limited to, transfomiation, transfection. electroporation, 
microinjection, viral-mediated transfection. and the like. 

Operably ioined As used herein, a coding sequence and a regulatory region are 
said to be "operably joined" when they are covalently linked in such a way as to place 
the expression or transcription of the coding sequence under the influence or control 
of the regulatory region. If it is desired that the coding sequences be translated into a 
functional protein, two DNA sequences are said to be ope,ably jomed if induction of 
promoter function results in the transcription of the coding sequence and if the nature 
of the linkage between the two DNA sequences does not (1) result in the introduction 
of a frame-shift mutation, (2) interfere with the ability of the regulatory region to 
direct the transcription of the coding sequences, or (3) interfere with the ability of the 
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coiresponding RNA transcript to be translated into a protein. Thus, a regulatory 
region would be operably joined to a coding sequence if the regulatory region were 
capable of effecting transcription of that DNA sequence such that the resulting 
transcript might be translated into the desired protein or polypeptide. 
5 Stringent hybridization conditions Stringent hybridization conditions is a tenn 

of art understood by those of ordinary skill in the art. For any given nucleic acid 
sequence, stringent hybridization conditions are those conditions of temperature, 
chaotrophic acids, buffer, and ionic strength which will pemit hybridization of that 
nucleic acid sequence to its complementary sequence and not to substantially different 
10 sequences. The exact conditions which constitute "stringent" conditions, depend upon 
the nature of the nucleic acid sequence, the length of the sequence, and the frequency 
of occurrence of subsets of that sequence within other non-identical sequences. By 
varying hybridization conditions from a level of stringency at which non-specific 
hybridization occurs to a level at which only specific hybridization is observed, one of 
15 ordinary skill in the art can, without undue experimentation, determine conditions 
which will allow a given sequence to hybridize only with complementary sequences. 
Suitable ranges of such stringency conditions are described in Krause and Aaionson 
( 1 99 1 ). Hybridization conditions, depending upon the length and commonality of a 
sequence, may include temperatures of 20''C-65''C and ionic strengths from 5x to O.lx 
20 SSC. Highly stringent hybridization conditions may include temperatures as low as 
40-42''C (when denaturants such as formamide are included) or up to dO-eS'C in ionic 
strengths as low as O.lx SSC. These ranges, however, are only illustrative and, 
depending upon the nature of the target sequence, and possible future technological 
developments, may be more stringent than necessary. Less than stringent conditions 
25 are employed to isolate nucleic acid sequences which are substantially similar, allelic 
or homologous to any given sequence. 

Selectively binds. As used herein with respect to antibodies, an antibody 
is said to "selectively bind" to a target if the antibody recognizes and binds the target 
of interest but does not substantially recognize and bind other molecules in a sample, 
e.g., a biological sample, which includes the target of interest. That is, the antibody 
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must bind to its target with sufficient specificity so as to distinguish the target from 
essentially all of molecules which would reasonably be present in a biological sample 
including the target. 

The Presenilins and Pres^^n jlin-Interartinp Prnt>.;^o 
5 The present invention is based, in part, upon the discovery of a family of 

mammalian genes which, when mutated, are associated with the developmem of 
Alzheimer's Disease. The discovery of these genes, designated presenilin-1 (PSl) and 

presenilin-2 (PS2). as well as the characterization of these genes, their protein 
products, mutants, invertebrate homologues. and possible functional roles, are 
10 described in PCT Publication WO96/34099. The present invention is further based, in 
part, upon the discovery of a group of proteins which interact with the presenilins 
under physiological conditions and which, thereforb. are believed to be involved in the 
biochemical pathways which are altered in Alzheimer's Disease. These proteins are 
referred to herein as presenilin-interactirig (PS-interacting) proteins. Because 
15 mutations in the presenilins are known to be causative of Alzheimer's Disease, each of 
the PS-interacting genes and proteins disclosed and described herein presents a novel 
target for dierapeutic intervention in Alzheimer's Disease. That is. modulation of the 
interactions of these proteins with the presenilins. or modulation of the interactions of 
at least the PS-interacting domains ofdiese PS-interacting proteins with at least the 
10 interacting domains of the presenilins. prx,vides a means of modulating the activity 
and/or availability of the presenilins, or of modulating the activity and/or availability 
of the PS-interacting proteins. Furtheraiore, as aberrations in the interactions of 

mutant presenilins with one or more of these PS-interacting proteins is causative of 
Alzheimer's Disease, mutatioris in one or more of these PS-interacting proteins are 

5 also likely to be causative of Alzheimer's Disease. Therefore, each of the PS- 
interacting genes and proteins disclosed and described herein presents a novel target 
for diagnosis of fomis of familial and/or sporadic Alzheimer's Disease with an 
etiology independent of mutations in the presenilins. Finally, as described more fully 
below, the PS-interacting genes and proteins described and disclosed herein provide 

' for new assays for compounds which affect the interactions of the pr^enilins and PS- 
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interacting proteins, assays for other members of the biochemical pathways involved 
in the etiology of Alzheimer's Disease, and new cell lines and transgenic animal 
models for use in such assays. 

5 1. Preseriilin Processing 

Employing the antibodies and protein-binding assays described and/or 
enabled in PCX Publication WO96/34099, the processing and protein-protein 
interactions of both normal and mutant preseniiins were investigated. It was found 
that mutations in the preseniiins appear to lead to changes in both their intracellular 
10 processing (e.g., endoproteolytic cleavage, ubiquitination, and clearance) and their 
intracellular interactions with other proteins expressed in human brain. As described 
below, knowledge of presenilin processing and interactions, and particularly changes 
in mutant presenilin processing and interactions, provides for new diagnostic and 
therapeutic targets for Alzheimer's Disease and related disorders. 

15 Western blot analysis suggests that the normal preseniiins undergo 

proteolytic cleavage to yield characteristic N- and C-terminal fragments. As noted 
above, the normal presenilin proteins have an expected molecular mass of 47-5 1 kDa 
depending, in part, upon mRNA splice variations, electrophoretic conditions, etc. 
Analysis of Western blots suggests, however, that the normal presenilin proteins 

20 undergo proteolytic cleavage to yield an approximately 35 kDa N-terminal fragment 
and an approximately 18 kDa C-terminal fragment. In particular. Western blots 
bearing lysates from wild-type native human fibroblasts, human neocortical brain 
tissue from control subjects, and neocortical brain tissue from non-transgenic and PSl 
transgenic mice using antibodies ("14.2") recognizing PSl-specific residues 1-25 at 

25 the N-terminus reveal the presence of a strong inununoreactive band of approximately 
35 kDa and, after longer exposures, a weaker band of approximately 45 kDa which 
presumably represents the full-length PSl protein. Antibodies ("520") directed at 
residues 304-318 at the apex of the TM6-^7 loop of PSl, and antibodies ("4627") 
directed at residues 457-467 in the C-terminus of PSl, both recognize the same strong 

30 band of approxiinately 18 kDa. Antibodies 520 also recognize a weak band of 45 kDa 
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coincident with the PS 1 band detected by 14.2. Sequencing of the major C-tenninal 
fragment from PSl-transfected human embryonic kidney cells (HEK 293) showed 
that the principal endoproteolytic cleavage occurs near M298 in the proximal portion 
of the TM6-.7 loop, possibly by enzymes other than the proteasome. These 
5 observations suggest that an endoproteolytic cleavage event occurs near the junction 

ofexons9andl0ofPSl. FuUlength PS I in these cells is quickly named over (t., < 
60 min.) by the proteasome. 

To determine whether mutations in the presenilin proteins result in 
alterations of their proteolytic cleavage. Western blots containing lysates of fibroblast 
10 and neocortical brain homogenates from nomial subjects and subjects carrying PSl 
mutations were investigated with the PSl specific antibody Ab 14.2. In fibroblasts, 
there were no obvious differences in the relative intensities of the protein bands whin 
lysates from heterozygous carriers of the PSl mutations were compared with nomial 
homozygotes. In contrast, there appeared to be a difference between PSl mutation 
15 «^^ers and normals in homogenates oftemporal neocortex from AD affected 

heterozygous earner, of either the PSl A246E or C4I0Y mutations (which are located 
in TM6 and TM7 respectively). In heteiozygotes. a strongly immunoreactive band of 
approximately 45 kDa was detected which initially appeared to correspond to the M\- 
length PSl protein. Further analysis, however, revealed that this band represents an 
0 alternatively processed presenilin product. A similar band corresponding to this 
mutant processed PSl was observed in neocortical homogenates from some sporadic 
late-onset AD patients. These data suggest that (1) some pathogenic PSl mutations 
associated with early-onset AD alter the way in which the presenilins are processed 
through endoproteolytic and proteasome pathways and (2) the presenilin proteins, and 
changes in the processing of the presenilins in the brain, are also implicated in late- 
onset and sporadic AD, 

2/ Presenilin^TntprarHn g Proteinc: 

In order to identify proteins which may bind to or otherwise interact with 
the presenilins mvivo, a yeast two-hybrid system was used as described below 
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(Example 1 ). In particular, because mutations in the TM6->7 loop domains are 
known to be causative of AD, a'yeast two-hybrid system was used to identify cellular 
proteins which may interact with normal and mutant presenilin TM6-^7 loop 
domains. Yeast two-hybrid studies were also done with cDNAs corresponding to the 
5 C^lerminal 18 kDa endoproteolytic cleavage fragment, and with cDNAs 

corresponding to the TMl-^2 intraluminal loop domain, which is also the site of the 
FAD associated Yl 15H missense mutation. In brief, cDNA sequences encoding the 
TM6->7 loop (i.e., residues 266 to 409 of PSl) were ligated in-frame to the GAL4 
DNA-binding domain in the pAS2-l yeast expression plasniid vector (Clontech). 
10 This plasmid was then co-transformed into S. cerevisiae strain Yl 90 together with a 
library of human brain cDNAs ligated into the pACT2 yeast expression vector bearing 
the GAL4 activation domain (Clontech). After appropriate selection and re-screening, 
a number of clones were recovered and sequenced bearing human brain cDNAs 

encoding peptides which interacted with the noraial presenilin TM6^7 domain. To 

15 determine whether these presenilin interactions would be modified by AD related 
mutations within the TM6-*7 loop, the yeast two-hybrid system was again used with 
TM6-*7 loop peptides containing the L286V, the L3 92y , and the exon 1 0 sphging 
mutants. When these mutant constructs were used as "bait" to re-screen the brain 
cDNA:GAL4 activation domain library, some but not all of the brain cDNA 

20 sequences which interacted with the normal presenilin were recovered. In addition, 
several new clones were identified which interacted with the mutant but not the 
normal presenilins. The clones corresponding to the PS-interacting proteins with the 
highest presenilin affinity are described in Example 1 and below. 

PS-interacting proteins, particularly those which interact selectively with 

25 either the normal or mutant presenilins, provide new targets for the identification of 
useful phanmaceuticals, new targets for diagnostic tools in the identification of 
individuals at risk, new sequences for the production of transformed cell lines and 
transgenic animal models, and new bases for therapeutic intervention in Alzheimer's 
Disease. In particular, the onset of AD may be associated with aberrant interactions 

30 between mutant presenilin proteins and normal forms of PS-interacting proteins such 
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as those identified using the methods described herein. These changes may increase 
or decrease interactions present with normal PSl or may cause interaction with a 
novel mutation-specific PS-interacting protein. In addition, however, aberrant 
interactions may resuh from normal presenilins binding to mutant forms of the PS- 
i interacting proteins and, therefore, mutations in the PS-interacting proteins may also 
be causative of AD. 

A- The S5a Su hunit of thp. 76S Proteasnme 

Two overlapping clones have been identified as representing a portion of 
the human protein alternatively known as Antisecretory Factor ("ASF") or the 
Multiubiquitin chain-binding S5a subunit of the 26S proteasome ("S5a"). These 
clones, which together include residues 70-377 of S5a. were shown to interact with 
the normal presenilin TM6-h>7 loop domain but only weakly with two TM6-^7 loop 
domain mutants tested (L286V, L392V). The PS I :S5a interaction was confirmed by 
co-immunoprecipitation studies, and immunocytochemical studies showed S5a and 
PSl are expressed in contiguous intracellular compartments in brain cells typically 
affected by AD. 

The interaction between PSl and the proteasome coiild be relevant to the 
pathogenesis of Alzheimer's Disease (AD) through several possible mechanisms. 
First, most mammalian cells seem to maintain very low levels of the PSl holoprotein. 
A notable exception to this are cells expressing the PSl A290-319 splicing mutation, 
which results in a mutant PSl holoprotein which is not endoproteolytically cleaved ' 
and which is. therefore, readily detectable. In the case of the A290-3 19 splicing 
mutation at least, the presence of the mutant PSl holoprotein, or the absence or 
reduction in the 35 kDa N-temainal and 1 8 kDa C-terminal fragments, appears 
sufficient to cause AD. It is possible, therefore, that even very subtle changes in the 
turnover of the mutant PS 1 holoprotein might have significant pathophysiological 
effects. Thus, mutations in either the presenilins or S5a which perturb the PSl :S5a 
interaction in the mammalian CNS may cause the presenilin holoprotein to be 
aberrantly processed and cause AD. Therefore, modulation of presenilin proteolytic 
pathways might be applied therapeutically to enhance removal of mutant holoprotein: 
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To assess a potential in vivo relationship between PSl and the S5a subunit 
of the 26S proteasome, the effects of proteasome inhibitors on PSl metabolism were 
investigated. Short term organotypic cultures of neonatal rat hippocampus and 
carcinoma of colon (CaCo2) cells (which express high levels of both PSl and PS2) 
5 were administered either the specific, reversible proteasome inhibitor N-acetyl- 

leucinyl-leucinyl-norleucinyl-H (LLnL) (Rock et aL, 1994), or the specific irreversible 
proteasome inhibitor lactacystin (Fenteany et al., 1995). Both agents caused an 
increase in the steady state levels of PSl holoprotein. Both agents also prolonged the 
half-life of the PSl holoprotein in pulse chase experiments in hippocampal slices from 

10 - 1 5 minutes to -35 minutes. As noted above, the PS 1 holoprotein appears to be 
rapidly turned over in normal cells. However, even after four hours of metabolic 
labelling, neither of the proteasome inhibitors affected the level of the 35 kDa N- 
terminal PSl fragment, or resulted in the appearance of novel species. These studies 
imply that the majority of the PSl holoprotein is catabolized directly via a rapid, 

15 proteasome dependent pathway in a manner similar to several other integral 

membrane proteins (e.g. Sec61 and CFTR). On the other hand, because the -35 kDa 
and - 1 8 kDa terminal fragments are still produced in the presence of proteasome 
inhibitors, this endoproteolytic cleavage of PS 1 is probably not mediated by the 
proteasome pathway. Therefore, it appears that at least two proteolytic pathways act 

20 upon the PSl holoprotein. 

An alternate possibility is that mutant PS 1 :S5a interactions may modify 
the function or the cellular regulation of S5a. To address this possibility, S5a levels 
were examined by Western blotting of lysates from postmortem temporal neocortex 
from non-AD neurologic controls (n = 8), sporadic AD (n = 8) and PSl -linked FAD 

25 (n = 4). In the majority of non-AD brains, polyclonal anti-S5a antibodies specifically 
detected an S5a species with Mr of - 50 kDa, which could be abolished by 
preabsorption of the antibody with recombinant His^-SSa or with extracts of mvc -S5a 
transfected cells. In a subset of these control cases an additional S5a reactive band 
was observed at --34 kDa. In contrast, in tissue from all subjects with sporadic late 

30 onset AD, the predominant S5a reactive species was observed at - 40 kDa which was 
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not seen .n control tissue. The origin, and the functional significance of this altered 
electrophoretic mobility is unclear but indicates that S5a processing is altered in AD 
brains, irrespective of whether the AD is presenilin-linked or sporadic. 

Thus, the presenilin-proteasome interaction appears significant in several 
5 respects. First, the facts that the normal presenilin TM6-.7 loop domain mteracts 
with the S5a protein, that the mutant presenilin TM6^7 loop domains fail to interact 
(or mteract very weakly) with the S5a protein, that presenilins bearing mutations m 
the TM6->7 loop domain appear to be differently cleaved and multiubiquitinated that 
proteasomes are known to be involved in the cleavage and clearance of a variety of 
10 P'-o'eins (particularly multiubiquitinated proteins), that inhibition of proteasome 
activity inhibits cleavage of the presenilin holoproteins. and that S5a processing is 
altered in AD brains, all suggest (1) that the S5a subunit and the 26S proteasome are 
involved in the normal processing of the presenilins and that mutations which disr.pt 
this normal interaction may be responsible for the abnomial processing observed in 
15 TM6-.7 loop domain mutants; or (2) that Ae presenilin-proteasome interaction may 

modulate the activity of PS 1 . S5a. or both, with or without involving proteasome- 
mediated presenilin processing; or (3) that modulation of the normal quality control 
function of proteasome-mediated degradation of misfolded or mutant membrane 
proteins trafficking through the ER and Golgi (such as APP. Notch, or Prion proteins) 
20 and of misfolded. mutant, or ubiquitinated cytoplasmic pn^teins (including structural 

protems such as tau, and short lived, proteasome processed signaling molecules such 
as NFkB). Thus, defective proteasome function might selectively cause these protems 

(especially PAPP. tau. Prion) to be aberrantly metabolized. TT.e latter would lead to 
the accumulation of neurotoxic, amyloidogenic protease-resistant derivatives such as 

^ AP and PrPsc. the accumulation of neurofibrillar, tangles, and defective intracellular 
signalmg fimctions. In suppon of these hypotheses, it should be noted that failure to 
clear hyperubiquitinated phosphorylated tau and other microtubule associated proteins 
IS a prommenl feature of Alzheimer's Disease (Kosik and Greenberg, 1994) 
suggesting a possible link between TM6-.7 loop domain mutants, p^enilin- 

0 P-'-^o- interactions, tau-proteasome interactions, and the neurofibrillary t^^^^^ 
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of tau protein in AD brains. Finally, proteasomes are known to be capable of 
degrading APP and of binding the AP peptides which are associated with Alzheimer^s 
Disease, suggesting a possible link between TM6->7 loop domain mutants, 
presenilin-proteasome interactions, APP-proteasome interactions, and the amyloid 
5 plaques characteristic of AD brains. Furthermore, administration of proteasome 
inhibitors such as LLnL and Lactacystin cause severe disturbances in pAPP 
mejtabolism with increases in intracellular immature N-glycosylated pAPP, and the 
secretion of much larger amounts of AP42 isoforms into the media (Klafki, et al., 
1996). 

10 Therefore, presenilin processing and the presenilin-proteasome interaction 

are clear targets for the diagnosis as well as therapeutic intervention in AD. Thus, as 
described below, assays may now be provided for drugs which affect the proteasome- 
mediated cleavage of the presenilins, which affect the alternative endoproteolytic 
cleavage and ubiquitination of the mutant presenilins, or which otherwise affect the 

15 processing and trafficking of the presenilins or the S5a subunit of the proteasome. In 
addition, as mutations in the 26S proteasome which disrupt the normal processing of 
the presenilins are likely to be causative of Alzheimer's Disease, additional diagnostic 
assays are provided for detecting mutations in the S5a or other subunits of the 
proteasome. Finally, additional transfomned cell lines and transgenic models may 

20 now be provided which have been altered by the introduction of a normal or mutant 
sequence encoding at least a functional domain of the proteasome. The appearance of 
abnormal electrophoretic forms of S5a (and/or other proteasome subunits) in biologic 
tissues and fluids can be used as a clinical test for diagnosis and monitoring of disease 
activity in subjects with sporadic forms of AD. 

25 B. GT24: A Protein with "Airnadillo" Repeats 

Another PS-interacting protein, designated GT24, was identified from 
several over-lapping clones obtained using a PSI26M09 domain as bail in the yeast two- 
hybrid system and a human adult brain cDNA library. Six longer GT24 clones of 
-3.8 kb in size were subsequently obtained by screening of conventional cDNA 

30 libraries. The open reading frame within the longest GT24 clone obtained to date 
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(Accession number U81004) suggests that GT24 is a protein of at least 1040 amino 
acids with a unique N-terminus, and considerable homology to several armadillo 
(arm) repeat proteins at its C-terminus. Thus, for example, residues 440-862 of GT24 
(numbering from Accession number U81004) have 32-56% identity (p=l.2c"') to 
5 residues 440-854 of murine pi 20 protein (Accession number Z 1 7804), and residues 
367-815 of GT24 have 26-42% identity (p=0.001 7) to residues 245-465 of the a 
melanogaster amiadillo segment polarity protein (Accession number P 1 8824). The 
GT24 gene maps to chromosome 5pI5 near the anonymous microsatellite marker 
D5S748 and the Cri-du-Chat syndrotne locus. 

10 "y''"^i2^''0"0f"nique 5' sequences of GT24 to Northeni blots re^^^^ 

that the GT24 gene is expressed as a range of transcripts varying in size between -3.9 
and 5.0 kb in sev^al regions of human brain, and in several non-neurologic tissues 
such as heart. In addition, rnsitu hybridization studies using a 289 bp single copy 
fragment from the 5' end of GT24 in four month old murine brain reveal GT24 
transcription closely parallels that of PSl. with robust expression in dentate and 
hippocampal neurons, in scattered neocortical neurons, and in cerebellar Purkinje 
cells. In day E13 murine embryos. GT24 is widely expressed at low levels, but is 
expressed at somewhat higher levels in somites and in the neural tube. A 
physiological invivo interaction between GT24 and PSl is supported by co- .. 
immunoprecipitation studies in HEK293 cells transiently transfected with a wild type 
human PSl cDNA, a c^-tagged cDNA encoding residues 484-1040 of GT24 
(including the C-terminal Mm repeats), or both cDNAs. Cell lysates were 
immunoprecipitated with anti-PSl antibodies and then investigated for the presence of 
the mYc,GT24 protein by immuno-blotting. In PSl/mYc-GT24 double transfected 
cells, the immunoprecipitates contained a robust anti-myc reactive band of Mr -60 
kDa, which co-migrated with a myc-GT24 control. In cells transfected with myc- 
GT24 only, a very weak band was detected after long exposures, presumably 
renecting interaction of the niyc-GT24 with low levels of endogenous PS I . No myc- 
reactive bands were detected in cells transfected with PSl alone, or in any of the 
transfected cells immunoprecipitated with pre-immune serum. Taken together, these 
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observations strongly suggest that the observed PS 1 :GT24 interaction is 
physiologically relevant. 

To explore whether mutations in the TM6-TM7 loop of PSl might 
influence the PSl :GT24 interaction, we employed quantitative liquid P-galactosidase 
5 assays to directly compare the yeast-two-hybrid interaction of the C-terminal residues 
499-1040 of GT24 with wildtype and mutant PSl24^09- These studies revealed that 
the interaction of GT244„.,o4o with a L286V mutant PSl domain was not significantly 
different from the interaction with the corresponding wild type PS 1 domain. In 
contrast, there was a significant reduction in the GT244„.,o4o interaction with the 
10 L392V mutant PS 1 construct. The absence of an effect of the L286V mutation, and 
the presence of an effect with the L392V mutation, may suggest that some mutations 
may effect PSl :GT24 binding, while others may modulate the PS 1 response to GT24 
binding. 

The PS I :GT24 interaction could support several functions. The arm repeat 
15 motif of GT24 has been detected in several proteins with diverse functions including 
P-catenin and its invertebrate homologue armadillo, plakoglobin, p 1 20, the 
adenomatous polyposis coli (APC) gene, suppressor of RNA polymerase 1 in yeast 
(SRPl), and smGDS. For example, p-catenin, pi 20 and plakoglobin play an essential 
role in intercellular adhesion, fl-catenin/ armadi 1 lo is involved in transduction of 
20 winglessAVnt signals during cell fate specification, and P-catenin and pi 20 may play 
a role in other receptor mediated signal transduction events including responses to 
trophic factors such as PDGF, EGF, CSF-1 and NGF. 

If the PSl :GT24 interaction is part of intercellular signahng pathways for 
trophic factors, or is involved in cell-cell adherence, disruption of the interaction may 
25 be involved in the neurodegenerative processes in PS-linked FAD brains, and in the 
increased sensitivity of PSl or PS2 transfected cells to apoptosis (Wolozin et al., 
1996). It is of note that at least one arm protein, smGDS. stimulates GDP/GTP 
exchange on intracellular G-proteins (Kikuchi et al. 1992; Borguski et aL, 1993), and 
that mutant forms of both PAPP and PS2 are thought to activate programmed cell 
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death pathways through mechanisms involving heterotrimeric GTP/GDP proteins 
(Wolozin et al, 1996; Okamoto, at al., 1995; Yamatsuji. et al, 1996). 

The interaction between PS 1 and GT24 may also be involved in some of 
the developmental phenotypes associated with homozygous PSl knockouts in mice 
5 such as failed somitogenesis of the caudal embryo, short tail, and fatal cerebral 
hemorrhage at around day E13.5 (Wong et al., 1996). The resemblance of these 
skeletal phenotypes to those associated with null mutations in PA2a and Notch, and 
the apparent suppressor effect of mutations in sella on Notch/linl2 mediated 
signaling in C. elegans suggest that the PS proteins function in the Notch signaling 
10 pathway. In addition, mice homozygous for a knockout of the Wnt-3a gene (Takada 
et al., 1994). and murine homozygotes for a spontaneous mutation, "vestigial tail" or 
Yt, in the WntOa gene (Greco et al., 1996). have skeletal phenotypes of defective 
caudal somite and tail bud fomiation. The Wnt3a knockouts are embryonic lethal by 
day 12.5. These phenotypes are similar to those of homozygous knockouts of the 
murine PSl gene (Wong et al.. 1996). The observation that GT24 binds to PSl, is 
expressed in embryonic somites, and contains the armadillo repeat motif of other 
proteins used in the downstream signaling in the Wingless^ pathway suggests that 
PSl is a downstream element in the GT24 -WinglessAVnt pathway. This can be 
exploited to create a bioassay for diugs affecting the GT24.PS 1 interaction directly, or 
affecting upstream or downstream components of that interaction pathway, and can 
therefore be used to monitor the effects of presenilin mutations. For example, cells 
transfected with nonnal or mutant presenilins may be exposed to soluble Wnt-3a 
protein (or other Wnt proteins such as Wnt-1) and assayed for changes which are 
specific to the WinglessAVnt signaling pathway, or for any of the other changes 
described herein for cell assays (e.g., intracellular ion levels. Ap processing, 
apoptosis, etc.). 

Thus, the GT24 protein also presents new targets for diagnosis as well as 
therapeutic intervention in AD. For example, as mutations in the GT24 protein may 
also be causative of Alzheimer's Disease, additional diagnostic assays are provided for 
detecting mutations in these sequences. Similarly, additional transfomied cell lines 
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and transgenic models may now be provided which have been altered by introduction 
of a normal or mutant nucleic acid encoding at least a functional domain of the GT24 
protein, and particularly the functional domains (e.g., residues 70-377) which interact 
with the presenilins. Such transformed cells and transgenics will have utility in assays 
5 for compounds which modulate the presenilin-GT24 interactions. 

C. p0071: A Protein with "Armadillo" Repeats 

Another independent clone isolated in the initial screening with the wild 
type PS 1266^09 "bait" also encodes a peptide with C-terminal ann repeats (clone 
Y2H25, Accession number U81005). A longer cDNA sequence corresponding to the 

10 Y2H25 clone has been deposited with GenBank as human protein p007 1 (Accession 
number X8 1 889) and is reproduced herein as SEQ ID NO: 5. Clone Y2H25 
corresponds essentially to nucleotide positions 1 682-1 994 of SEQ ID NO: 5. 
Comparison of the predicted sequence of the Y2H25/p0071 ORF with that of GT24 
confirms that they are related proteins with 47% overall amino acid sequence identity, 

15 and with 70% identity between residues 346-862 of GT24 and residues 509-1022 of 
p0071 . This suggests that PSl interacts with a novel class of arm repeat containing 
proteins. The broad --4.5 kb hybridization signal obtained on Northern blots with the 
unique 5' end of GT24 could reflect either alternative splicing/polyadenylation of 
GT24 or, less likely, the existence of additional members of this family with higher 

20 degrees of N-terminal homology to GT24 than p0071 . Cells transformed with these 
sequences, or transgenic animals including these sequences, will have additional 
utility as animal models of AD and for use in screening for compounds which 
modulate the action of noraial and mutant presenilins. 

D, Rab 11 

25 One clone (Y2H9), disclosed herein as SEQ ID NO: 5, was identified as 

interacting with the normal PSl TM6->7 loop domain and appears to correspond to a 
known gene, Rabl 1, available through Accession numbers X56740 and X53143. 
Rabl 1 is believed to be involved in protein/vesicle trafficking in the ER/Golgi. Note 
the possible relationship to processing of membrane proteins such as BAPP and Notch 
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with resultant overproduction of toxic AB peptides (especially neurotoxic AB,^^,,, 
iso forms) (Scheuner, et al. 1 995). 

E. Retinoid X Receotor-P 

One clone (Y2H23b), disclosed herein as SEQ ID NO: 6, was identified as 
interacting with the normal PSl TM6->7 loop domain and appears to correspond to a 
known gene, known variously as the retinoid X receptor-p, nuclear receptor co- 
regulator, or MHC Class I regulatory element, and is available through Accession 
numbers M84820. X63522 and M81766. This gene is believed to be involved in 
intercellular signaling, suggesting a possible relationship to the intercellular signaling 
function mediated by Celegans selI2 and Notch/lin-I2 (transcription activator). 
F- Cytoplasmic Chapemnin 

One clone (Y2H27), disclosed herein as SEQ ID NO: 8, was identified as 
interacting with the normal PS 1 TM6-^7 loop domain and appears to correspond to a 
known gene, a cytoplasmic chaperonin containing TCP- 1 , available through 
15 Accession numbers UI7104 and X74801. 

G. Clone Y2H3S 

One clone (Y2H35), disclosed herein as SEQ ID NO: 7, was identified as 
interacting with the normal PSl TM6^7 loop domain and appears to correspond to a 
sequence that codes for a protein of unknown function, available through Accession 
number RI2984. but which displays evolutionary conservation in yeast sequences. 

H. Clone Y2H1 71 

One clone (Y2H171). disclosed herein as SEQ ID NO: 9, was identified as 
interacting with the nomial PS 1 TM6-^7 loop domain and appears to correspond to a 
known expressed repeat sequence available through Accession number D55326. 
25 I. Clone Y2H41 

One clone (Y2H41 ) was identified which reacts strongly with the TM6-^7 
loop domains of both PSl and PS2 as well as the mutant loop domains of PSl. The 
sequence, disclosed as SEQ ID NO: 10, shows strong homology to an EST.of 
unknown function (Accession number T64843). 



20 
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III. Preferred Embodiments 

Based, in part, upon the discoveries disclosed and described herein, the 
following preferred embodiments of the present invention are provided. 
5 . ■ 

1. Isolated Nucleic Acids 

In one series of embodiments, the present invention provides isolated 
nucleic acids corresponding to, or relating to, the nucleic acid sequences disclosed 
herein, which encode at least the PS-interacting domain of a PS-interacting protein. 

10 As described more fully below, the disclosed and enabled sequences include normal 
sequences from humans and other mammalian species, mutant sequences from 
humans and other mammalian species, homologous sequences from non-mammalian 
species such as Drosophila and C. elegans. subsets of these sequences useful as probes 
and PCR primers, subsets of these sequences encoding fragments of the PS-interacting 

15 proteins or corresponding to particular structural domains or polymorphic regions, 
complementary or antisense sequences corresponding to fragments of the PS- 
interacting protein genes, sequences in which the PS-interacting protein coding 
regions have been operably joined to exogenous regulatory regions, and sequences 
encoding fusion proteins in which portions of the PS-interacting proteins are fused to 

20 other proteins useful as markers of expression, as "tags" for purification, or in screens 
and assays for other proteins which interact with the PS-interacting proteins. 

Thus, in a first series of embodiments, isolated nucleic acid sequences are 
provided which encode at least a PS-interacting domain of a normal or mutant version 
of a PS-interacting protein. Examples of such nucleic acid sequences are disclosed 

25 herein as SEQ ID NOs: 1, 3, and 5-10, In addition, given the sequences of the PS- 
interacting domains of the PS-interacting proteins disclosed herein, one of ordinary 
skill in the art is clearly enabled to obtain the entire genomic or cDNA sequence 
encoding the entire PS-interacting proteins. Thus, for example, based upon the initial 
clone of the GT24 protein obtained using the yeast two-hybrid system (Example 1). 

30 the larger GT24 clone disclosed as SEQ ID NO: 3 was obtained by standard methods 
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known in the art. Complete cDNA or genomic clones of each of the genes encoding 
the disclosed sequences may be similarly obtained by one of ordinary skill in the art. 
Therefore, the present invention provides complete genomic sequences as well as 
cDNA sequences corresponding to the PS-interacting protein genes of the invention. 
Alternatively, the nucleic acids of the invention may comprise recombinant genes or 
"minigenes" in which all or some introns of the PS-interacting protein genes have 
been removed, or in which various combinations of introns and exons and local cis- 
acting regulatory elements have been engineered in propagation or expression 
constructs or vectors. For purposes ofreducing the size of a recombinant PS- 
interacting protein gene, a cDNA gene may be employed, or various combinations of 
introns and untranslated exons may be removed from a DNA constnict. These and 
many variations on these embodiments are now enabled by the identification and 
description of the PS-interacting proteins provided herein. 

In addition to the disclosed PS-interacting protein and gene sequences, one 
of ordinary skill in the art is now enabled to identify and isolate nucleic acids 
representing PS-interacling genes or cDNAs which are allelic to the disclosed 
sequences or which are heterospecific homologues. Thus, the present invention 
provides isolated nucleic acids conresponding to these alleles and homologues, as well 
as the various above-described recombinant constmcts derived from these sequences, 
by means which are well known in the art. Briefly, one of ordinary skill in the art 
may now screen preparations of genomic or cDNA. including saihples prepared from 
individual organisms (e.g.. human AD patients or their family members) as well as 
bacterial, viral, yeast or other libraries of genomic or cDNA, using probes or PCR 
primers to idemify allelic or homologous sequences. Because it is desirable to 
identify mutations in the PS-interacting proteins which may contribute to the 
development of AD or other disorders, because it is desirable to idemify 
polymorphisms in the PS -interacting proteins which are.not pathogenic, and because 
it is also desirable to create a variety of animal models which may be used to study 
AD and screen for potemial therapeutics, it is particularly contemplated that additional 
PS-interacting protein sequences will be isolated from other preparations or libraries 
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of human nucleic acids and from preparations or libraries from animals including rats, 
mice, hamsters, guinea pigs, rabbits, dogs, cats, goats, sheep, pigs, and non-human 
primates. Furthermore, PS-interacting protein homologues from yeast or invertebrate 
species, including C. elegans and other nematodes, as well as Drosophila and other 
5 insects, may have particular utility for drug screening. 

Standard hybridization screening or PCR techniques may be employed (as 
used, for example, in the identification of the mPSl gene disclosed in PCT 
Publication WO96/34099) to identify and/or isolate such allelic and homologous 
sequences using relatively short PS-interacting protein gene sequences. The 

10 sequences may include 8 or fewer nucleotides depending upon the nature of the target 
sequences, the method employed, and the specificity required. Future technological 
developments may allow the advantageous use of even shorter sequences. With 
current technology, sequences of 9-50 nucleotides, and preferably about 18-24 are 
preferred. These sequences may be chosen from those disclosed herein, or may be 

15 derived from other allelic or heterospecific homologues enabled herein. When 
probing mRNA or screening cDNA libraries, probes and primers from coding 
sequences (rather than introns) are preferably employed, and sequences which are 
omitted in alternative splice variants typically are avoided unless it is specifically 
desired to identify those variants. Allelic variants of the PS -interacting protein genes 

20 may be expected to hybridize to the disclosed sequences under stringent hybridization 
conditions, as defined herein, whereas lower stringency may be employed to identify 
heterospecific homologues. 

In another series of embodiments, the present invention provides for 
isolated nucleic acids which include subsets of the PS-interacting protein sequences or 

25 their complements. As noted above, such sequences will have utility as probes and 
PCR primers in the identification and isolation of allelic and homologous variants of 
the PS-interacting protein genes. Subsequences corresponding to polymorphic 
regions of the PS-interacting proteins, will also have particular utility in screening 
and/or genotyping individuals for diagnostic purposes, as described below. In 

30 addition, and also as described below, such subsets will have utility for encoding (1) 
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fragments of the PS-interacting proteins for inclusion in fusion proteins. (2) fragments 
which comprise fiinctional domains of the PS-interacting proteins for use in binding 
studies, (3) fragments of the PS-interacting proteins which may be used as 
immunogens to raise antibodies against the PS-interacting proteins, and (4) fragments 
of the PS-interacting proteins which may act as competitive inhibitors or as mimetics 
of the PS-interacting proteins to inhibit or mimic their physiological functions. 
Finally, such subsets may encode or represent complementary or antisense sequences 
which can hybridize to the PS-interacting protein genes or PS-interacting protein 
mRNA transcripts under physiological conditions to inhibit the transcription or 
translation of those sequences. Therefore, depending upon the intended use. the 
present invention provides nucleic acid subsequences of the PS-interacting protein 
genes which may have lengths varying from 8-10 nucleotides (e.g., for use as PGR 
primers) to nearly the full size of the PS-interacting protein genomic or cDNAs. 
Thus, the present invention provides isolated nucleic acids comprising sequences 
corresponding to at least 8-10. preferably 15, and more preferably at least 20 
consecutive nucleotides of the PS-interacting protein genes, as disclosed or otherwise 
enabled herein, or to their complements. As noted above, however, shorter sequences 
may be useful with different technologies. 

In anotiier series of embodiments, the present invention provides nucleic 
acids in which the coding sequences for the PS-interacting proteins, with or without 
introns or recombinantly engineered as described above, are operably joined to 
endogenous or exogenous 5' and/or 3' regulatory regions. Using the present disclosure 
and standard genetic techniques (e.g., PGR extensions, targeting gene walking), one of 
ordinary skill in the art is now enabled to clone the 5' and/or 3' endogenous regulatory 
regions of any of the disclosed PS-interacting protein genes. Similarly, allelic 
variants of these endogenous regulatory regions, as well as endogenous regulatory 
regions from other mammalian homologues, are similarly enabled without undue 
experimentation. Alternatively, exogenous regulatory regions (i.e., regulatory regions 
from a different conspecific gene or a heterospecific regulatory region) may be 
operably joined to the PS-interacting protein coding sequences in order to drive 
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expression. Appropriate 5* regulatory regions will include promoter elements and 
may also include additional elements such as operator or enhancer sequences, 
ribosome binding sequences, RNA capping sequences, and the like. The regulatory 
region may be selected from sequences that control the expression of genes of 
5 prokaryotic or eukaryotic cells, their viruses, and combinations thereof Such 

regulatory regions include, but are not limited to, the lac system, the trp system, the 
tac system, and the trc system; major operator and promoter regions of phage X; the 
control region of the fd coat protein; early and late promoters of SV40; promoters 
derived from polyoma, adenovirus, retrovirus, baculovirus, and simian virus; 3- 

10 phosphoglycerate kinase promoter; yeast acid phosphatase promoters; yeast alpha- 
mating factors; promoter elements of other eukaryotic genes expressed in neurons or 
other cell types; and combinations thereof In particular, regulatory elements may be 
chosen which are inducible or repressible (e.g., the P-galactosidase promoter) to allow 
for controlled and/or manipulable expression of the PS-interacting protein genes in 

15 cells transformed with these nucleic acids. Alternatively, the PS-interacting protein 
coding regions may be operably joined with regulatory elements which provide for 
tissue specific expression in multicellular organisms. Such constructs are particularly 
useful for the production of transgenic organisms to cause expression of the PS- 
interacting protein genes only in appropriate tissues. The choice of appropriate 

20 regulatory regions is within the ability and discretion of one of ordinary skill in the art 
and the recombinant use of many such regulatory regions is now established in the art. 

In another series of embodiments, the present invention provides for 
isolated nucleic acids encoding all or a portion of the PS-interacting proteins in the 
form of a ftision protein. In these embodiments, a nucleic acid regulatory region 

25 (endogenous or exogenous) is operably joined to a first coding region which is 
covalently joined in-frame to a second coding region. The second coding region 
optionally may be covalently joined to one or more additional coding regions and the 
last coding region is joined to a termination codon and, optionally, appropriate 3' 
regulatory regions (e.g., polyadenylation signals). The PS-interacting protein 

30 sequences of the fusion protein may represent the first, second, or any additional 
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coding regions. The PS-interacting protein sequences may be conserved or non- 
conserved domains and can be placed in any codmg region of the fusion. The non- 
PS-interacting protem sequences of the fiision may be chosen according to the needs 
and discretion of the practitioner and are not limited by the present invention. Usefiil 
5 non-PS-interacting protein sequences include, for example, short sequence "tags" such 
as antigenic detemiinants or poIy-His tags which may be used to aid in the 
identification or purification of the resultant fusion protein. Alternati vely, the non- 
PS-interacting protein coding region may encode a large protein or protein fragment;.. ; 
such as an enzyme or binding protein which also may assist in the identification and 
10 purification of the protein, or which may be useful in an assay such as those described 
below. Particularly contemplated fusion proteins include poly-His and GST 
(glutathione S-transferase) fusions which are useful in isolating and purifying the 
presenilins-interacting proteins, and the yeast two hybrid fusions, described below, 
which are useful in assays to identify other proteins which bind to or interact with the 
15 PS-interacting proteins. 

In another series of embodiments, the present invention provides isolated 
nucleic acids in the form of recombinant DNA constructs in which a marker or 
reporter gene (e.g., P-galactosidase. luciferase) is operably joined to the 5' regulatory 
region of a PS-interacting protein gene such that expression of the marker gene is 

under the control of those regulatory sequences. Using the PS-interacting protein 
regulatory regions enabled herein, including regulatory regions from human and other 
mammalian species, one of ordinary skill in the art is now enabled to produce such 
constructs. As discussed more fully below, such isolated nucleic acids may be used to 
produce cells, cell lines or transgenic animals which are useful in the identification of 
compounds which can. directly or indirectly, differentially affect the expression of the 
PS-interacting proteins. 

Finally, the isolated nucleic acids of the present invention include any of 
the above described sequences when included in vectors. Appropriate vectors include 
cloning vectors and expression vectors of all types, including plasmids, phagemids, 
cosmids, episomes, and the like, as well as integration vectors. The vectors may also 



20 
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include various marker genes (e.g., antibiotic resistance or susceptibility genes) which 
are useful in identifying cells successfully transformed therewith. In addition, the 
vectors may include regulatory sequences to which the nucleic acids of the invention 
are operably joined, and/or may also include coding regions such that the nucleic 
5 acids of the invention, when appropriately ligated into the vector, are expressed as 
fusion proteins. Such vectors may also include vectors for use in yeast "two hybrid," 
baculovirus, and phage-display systems. The vectors may be chosen to be useful for 
prokaiyotic, eukaryotic or viral expression, as needed or desired for the particular 
application. For example, vaccinia virus vectors or simian virus vectors with the 

10 SV40 promoter (e.g., pS V2), or Herpes simplex virus or adeno-associated virus may 
be useful for transfection of mammalian cells including neurons in culture or in vivo. 
and the baculovirus vectors may be used in transfecting insect cells (e.g., butterfly 
cells). A great variety of different vectors are now commercially available and 
otherwise known in the art, and the choice of an appropriate vector is within the 

15 ability and discretion of one of ordinary skill in the art. 

2. Substantially Pure Proteins 

The present invention provides for substantially pure preparations of the 
PS-interacting proteins, fragments of the PS-interacting proteins, and fusion proteins 
including the PS-interacting proteins or fragments thereof The proteins, fragments 
and fusions have utility, as described herein, in the generation of antibodies to normal 
and mutant PS-interacting proteins, in the identification of proteins (aside from the 
presenilins) which bind to the PS -interacting proteins, and in diagnostic and 
therapeutic methods. Therefore, depending upon the intended use, the present 
invention provides substantially pure proteins or peptides comprising amino acid 
sequences which are subsequences of the complete PS-interacting proteins and which 
may have lengths varying from 4-10 amino acids (e.g., for use as immunogens), or 10- 
100 amino acids (e.g., for use in binding assays), to the complete PS-interacting 
proteins. Thus, the present invention provides substantially pure proteins or peptides 
comprising sequences corresponding to at least 4-5, preferably 6-10, and more 
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preferably at least 50 or 100 consecutive amino acids of the PS-interacting proteins, as 
disclosed or otherwise enabled herein. 

The proteins or peptides of the invention may be isolated and purified by 
any of a variety of methods selected on the basis of the properties revealed by their 
protein sequences. For example, the PS-interacting proteins may be isolated from 
cells in which the PS-interacting protein is noimally highly expressed. Alternatively 
the PS-interacting protein, fusion protein, or fragment thereof, may be purified from 
cells transformed or transfected with expression vectors (e.g.. baculovirus systems 
such as the pPbac and pMbac vectors (Stratagene. La Jolla, CA); yeast expression 
systems such as the pYESHIS Xpress vectors (Invitiogen, San Diego. CA); eukaryotic 
expression systems such as pcDNAS (Invitrogen. San Diego, CA) which has constant 
constitutive expression, or LacSwitch (Stratagene, La Jolla, CA) which is inducible; 
or prokaiyotic expression vectors such as pKK233-3 (Clontech. Palo Alto. CA). In 
the event that the protein or fiagment integrates into the endoplasmic reticulum or 
plasma membrane of the recombinant cells (e.g.. eukaryotic cells), the protein may be 
purified from the membrane fraction. Alternatively, if the protein aggregates in 
inclusion bodies within the recombinant cells (e.g.. prokaiyotic cells), the protein may 
be purified from whole lysed cells or from solufailized inclusion bodies. 

Purification can be achieved using standard protein purification procedures 
including, but not limited to. gel-filtration chromatography, ion-exchange 
chromatography, high-performance liquid chromatography (RP-HPLC, ion-exchange 
HPLC. size-exclusion HPLC. high-performance chromatofocusing chromatography, 
hydrophobic interaction chromatography, immunoprecipitation, or immunoaff.nity 
purification. Gel electrophoresis (e.g., PAGE, SDS-PAGE) can also be used to isolate 
a protein or peptide based on its molecular weight, charge properties and 
hydrophobicity. 

A PS-interacting protein, or a fragment thereof, may also be conveniently 
purified by creating a fiision protein including the desired PS-interacting protein 
sequence fiised to another peptide such as an antigenic determinant or poly-His, tag 
(e.g., QiAexpress vectors, QIAGEN Corp., Chatsworth, CA). or a larger protein (e.g.. 
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GST using the pGEX-27 vector (Amrad, USA) or green fluorescent protein using the 
Green Lantern vector (GIBCO/BRL. Gaithersburg, MD). The fusion protein may be 
expressed and recovered from prokaryotic or eukaryotic cells and purified by any 
standard method based upon the fusion vector sequence. For example, the fusion 
5 protein may be purified by immunoaffmity or immunoprecipitation with an antibody 
to the non-PS-interacting protein portion of the fusion or, in the case of a poly-His tag, 
by affinity binding to a nickel column. The desired PS-interacting protein or fragment 
may then be further purified from the fusion protein by enzymatic cleavage of the 
fusion protein. Methods for preparing and using such fusion constructs for the 
10 purification of proteins are well known in the an and several kits are commercially 
available for this purpose. In light of the present disclosure, one is now enabled to 
employ such fusion constructs with the PS-interacting proteins. 

3. Antibodies to the PS-interactine Proteins 

15 The present invention also provides antibodies, and methods of making 

antibodies, which selectively bind to the PS-interacting proteins or fragments thereof 
Of particular importance, by identifying the PS-interacting domains of the PS- 
interacting proteins, and methods of identifying mutant forms of the PS-interacting 
proteins associated with Alzheimer's Disease, the present invention provides 

20 antibodies, and methods of making antibodies, which will selectively bind to and, 
thereby, identify and/or distinguish normal and mutant (i.e., pathogenic) forms of the 
PS-interacting proteins. The antibodies of the invention have utility as laboratory 
reagents for, inter aha , immunoaffinity purification of the PS-interacting proteins, 
Westem blotting to identify cells or tissues expressing the PS-interacting proteins, and 

25 immunocytochemistry or inmiunofluorescence techniques to establish the subcellular 
location of the proteins. In addition, as described below, the antibodies of the 
invention may be used as diagnostics tools to identify carriers of AD-related PS- 
interacting protein alleles, or as therapeutic tools to selectively bind and inhibit 
pathogenic forms of the PS-interacting proteins in vivo . 
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The antibodies of the invention may be generated using the entire PS- 
interacting proteins of the invention, or using any PS-interacting protein epitope 
which is characteristic of that protein and which substantially distinguishes it from 
other host proteins. Any method of choosing antigenic determinants known in the art 
may, of course, be employed. Such epitopes may be identified by comparing 
sequences of, for example, 4-10 amino acid residues from a PS-interacting protein 
sequence to computer databases of protein sequences from the relevant host. In 
addition, larger fragments (e.g., 8-20 or. preferably, 9-15 residues) including one or 
more potential epitopes may also be employed. Antibodies to the PS-interacting 
domains (identified by the yeast two-hybrid assays described below) are expected to 
have the greatest utility both diagnostically and therapeutically. On the other hand, 
antibodies against highly conserved domains are expected to have the greatest utility 
for purification or identification of PS-interacting proteins. 

PS-interacting protein immunogen preparations may be produced from 
crude extracts (e.g., lysates or membrane fractions of cells highly expressing the 
proteins), from proteins or peptides substantially purified from cells which naturally 
or recombinantly express them or, for short immunogens, by chemical peptide 
synthesis. The immunogens may also be in the form of a fusion protein in which the 
non-PS-interacting protein region is chosen for its adjuvant properties. As used 
herein, a PS-interacting protein immunogen shall be defined as a preparation 
including a peptide comprising at least 4-8, and preferably at least 9-15 consecutive 
amino acid residues of a PS -interacting proteins, as disclosed or otherwise enabled 
herein. Sequences of fewer residues may, of course, also have utility depending upon 
the intended use and fixture technological developments. Therefore, any PS- 
interacting protein derived sequences which are employed to generate antibodies to 
the PS-interacting proteins should be regarded as PS-interacting protein immunogens. 

The antibodies of the invention may be polyclonal or monoclonal, or may 
be antibody fragments, including Fab fragments. F(ab%. and single chain antibody 
fragments. In addition, after identifying usefiil antibodies by the metiiod of the 
invention, recombinant antibodies may be generated, including any of tiie antibody 
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fragments listed above, as well as humanized antibodies based upon non-human 
antibodies to the PS-interacting proteins. In light of the present disclosure, as well as 
the characterization of other PS-interacting proteins enabled herein, one of ordinary 
skill in the art may produce the above-described antibodies by any of a variety of 
5 standard means well known in the art. For an overview of antibody techniques, see 
Antibody E ngineerine: A Practical Guide . Borrebaek, ed., W,H. Freeman & 
Company, NY (1992), or Antibody Engineering . 2nd Ed., Borrebaek, ed.. Oxford 
University Press, Oxford (1995). 

As a general matter, polyclonal antibodies may be generated by first 
10 immunizing a mouse, rabbit, goat or other suitable animal with the PS -interacting 
protein immunogen in a suitable carrier. To increase the immunogenicity of the 
preparation, the immimogen may be coupled to a carrier protein or mixed with an 
adjuvant (e.g., Freund's adjuvant). Booster injections, although not necessary are 
reconmiended. After an appropriate period to allow for the development of a humoral 
15 response, preferably several weeks, the animals may be bled and the sera may be 
purified to isolate the immunoglobulin component. 

Similarly, as a general matter, monoclonal anti-PS-interacting protein 
antibodies may be produced by first injecting a mouse, rabbit, goat or other suitable 
animal with a PS-interactmg protein immunogen in a suitable carrier. As above, 
20 carrier proteins or adjuvants may be utilized and booster injections (e.g., bi- or tri- 
weekly over 8-10 weeks) are recommended. After allowing for development of a 
humoral response, the animals are sacrificed and their spleens are removed and 
resuspended in, for example, phosphate buffered saline (PBS); The spleen cells serve 
as a source of lymphocytes, some of which are producing antibody of the appropriate 
25 specificity. These cells are then fiised with an immortalized cell line (e.g., myeloma), 
and the products of the ftision are plated into a number of tissue culture wells in the 
presence of a selective agent such as HAT. The wells are serially screened and 
replated, each time selecting cells making usefiil antibody. Typically, several 
screening and replating procedures are carried out until over 90% of the wells contain 
30 single clones which are positive for antibody production. Monoclonal antibodies 
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produced by such clones may be purified by standard methods such as affinity 
chromatography using Protein A Sepharose. by ion-exchange chromatography, or by 
variations and combinations of these techniques. 

The antibodies of the invemion may be labelled or conjugated with other 
5 compounds or materials for diagnostic and/or therapeutic uses. For example, they 
may be coupled to radionuclides, fluorescent compounds, or enzymes for imaging or 
therapy, or to liposomes for the targeting of compounds contained in the liposomes to 
a specific tissue location. 

10 4. Transformed Cell Lines 

The present invention also provides for cells or cell lines, both prokaiyotic 
and eukaryoUc, which have been transformed or transfected with the nucleic acids of 
the present invention so as to cause clonal propagation of those nucleic acids and/or 
expression of the proteins or peptides encoded thereby. Such cells or cell lines will 
have utility both in the propagation and production of the nucleic acids and proteins of 
the present invention but also, as further described herein, as model systems for 
diagnostic and therapeutic assays. In particular, it is expected that cells co- 
transformed with PS-interacting protein sequences as well as presenilin sequences will 
have improved utility as models of the biochemical pathways which may be affected 
in AD. For example, cells co-transfonned with the interacting domains of PS- 
interacting sequences and presenilins in yeast two-hybrid fusion constructs, will have 
utiHty in screening for compounds which either enhance or inhibit interactions 
between these domains. Similarly, for cells transformed with a heterospecific 
presenilm. co-transformation with a similarly heterospecific PS-interacting protein, or 
co-transformation and homologous recombination to introduce a similarly 
heterospecific PS-interacting domain of a PS-interacting protein (e.g.. "humanizing" a 
non-human endogenous PS-imeracting protein), will result in a better model system 
for studying the interactions of the presenilins and the PS-interacting proteins. Cells 
transfomied with only PS-interacting sequences will, of course, have utility of their 
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own for studying the role of these proteins in the etiology of AD, and also as 
precursors for presenilin co-transformed cells. 

As used herein, the term "transformed cell" is intended to embrace any 
cell, or the descendant of any cell, into which has been introduced any of the nucleic 
5 acids of the invention, whether by transfonnation, transfection, infection, or other 
means. Methods of producing appropriate vectors, transforming cells with those 
vectors, and identifying transformants are well known in the art and are only briefly 
reviewed here (see. for example, Sambrook et aL (1989) Molecular Cloning: A 
Laboratory Manual. 2nd ed.. Cold Spring Harbor Laboratory Press, Cold Spring 

10 Harbor, New York). 

Prokaryotic cells useful for producing the transformed ceils of the 
invention include members of the bacterial genera Escherichia (e.g., E. coli ). 
Pseudomonas (e.g., P. aeruginosa) , and Bacillus (e.g., B. subtillus . B. 
stearothermophilus) . as well as many others well known and frequently used in the 

15 art. Prokaryotic cells are particularly useful for the production of large quantities of 
the proteins or peptides of the invention (e.g., normal or mutant PS-interacting 
proteins, fragments of the PS-interacting proteins, fusion proteins of the PS- 
interacting proteins). Bacterial cells (e.g., E. coli) may be used with a variety of 
expression vector systems including, for example, plasmids with the T7 RNA 

20 polymerase/promoter system, bacteriophage k regulatory sequences, or Ml 3 Phage 
mGPI-2. Bacterial hosts may also be transformed with fusion protein vectors which 
create, for example, lacZ, trpE, maltose-binding protein, poly-His lags, or glutathione- 
s-transferase fusion proteins. All of these, as well as many other prokaryotic 
expression systems, are well known in the art and widely available commercially 

25 (e.g., pGEX-27 (Amrad, USA) for GST fusions). 

Eukaryotic cells and cell lines useful for producing the transformed cells of 
the invention include mammalian cells and cell lines (e.g., PC12, COS, CHO, 
fibroblasts, myelomas, neuroblastomas, hybridomas, human embryonic kidney 293, 
oocytes, embryonic stem cells), insect cells lines (e.g., using baculovirus vectors such 

30 as pPbac or pMbac (Stratagene, La Jolla, CA))^ yeast (e.g., using yeast expression 
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vectors such as pYESHIS(Invitn>gen, CA)). and fungi. Eukaiyotic cells are . 
particularly useful for embodiments in which it is necessary that the PS-interacting 
proteins, or functional fragments thereof, perfonn the fianctions and/or undergo the 
intracellular interactions associated with either the normal or mutant proteins. Thus, 
for example, transfonned eukaryotic cells are preferred for use as models of PS- 
interacting protein function or interaction, and assays for screening candidate 
therapeutics preferably employ transformed eukaryotic cells. 

To accomplish expression in eukaryotic cells, a wide variety of vectors 
have been developed and are commercially available which allow inducible (e.g.. 
LacSwitch expression vectors, Stratagene, La Jolla, CA) or cognate (e.g.. pcDNA3 
vectors, Invitrogen. Chatsworth, CA) expression of PS-interacting protein nucleotide 
sequences under the regulation of an artificial promoter element. Such promoter 
elements are often derived from CMV or SV40 viral genes, although other strong 
promoter elements which are active in eukaryotic cells can also be employed to induce 
transcription of PS-interacting protein nucleotide sequences. Typically, these vectors 
also contain an artificial polyadenylation sequence and 3' UTR which can also be 
derived from exogenous viral gene sequences or from other eukaryotic genes. 
Furthermore, in some constructs, artificial, non-coding, spliceable introns and exons 
are included in the vector to enhance expression of the nucleotide sequence of interest. 
These expression systems are commonly available from commercial sources and are 
typified by vectors such as pcDNAS and pZeoSV (Invitrogen, San Diego. CA). 
Innumerable commercially-available as well as custom-designed expression vectors 
are available from commercial sources to allow expression of any desired PS- 
interacting protein transcript in more or less any desired cell type, either constitutively 
or after exposure to a certain exogenous stimulus (e.g.. withdrawal of tetracycline or 
exposure to BPTG). 

Vectors may be introduced into the recipient or "host" cells by various 
methods well known in the art including, but not limited to, calcium phosphate 
transfection. strondum phosphate transfection, DEAE dextran transfection 
electroporation. lipofection (e.g.. Dosper Liposomal transfection reaeent, Boehringer 
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Mannheim, Germany), microinjection, ballistic insertion on micro-beads, protoplast 
fusion or, for viral or phage vectors, by infection with the recombinant virus or phage. 

5. Transgenic Animal Models 
5 The present invention also provides for the production of transgenic non- 

human animal models in which mutant or wild type PS-interacting protein sequences 
are expressed, or in which the PS-interacting protein genes have been inactivated 
(e.g., "knock-out" deletions), for the study of Alzheimer's Disease, for the screening of 
candidate pharmaceutical compounds, for the creation of explanted manunalian CNS 

10 cell cultures (e.g., neuronal, glial, organotypic or mixed cell cultures), and for the 
evaluation of potential therapeutic interventions. Prior to the present invention, a 
partial animal model for Alzheimer's Disease existed via the insertion and over- 
expression of a mutant form of the human amyloid preciu^or protein gene as a 
minigene imder the regulation of the platelet-derived growth factor p receptor 

15 promoter element (Games et al., 1995). This mutant (PAPP^,, Val->Ile) causes the 
appearance of synaptic pathology and amyloid p peptide deposition in the brain of 
transgenic animals bearing this transgene in high copy number. These changes in the 
brain of the transgenic animal are very similar to that seen in human AD (Games et 
al., 1 995). It is, however, as yet unclear whether these animals become demented, but 

20 there is general consensus that it is now possible to recreate at least some aspects of 
AD in mice. In addition, transgenic animal models in which the presenilin genes are 
genetically engineered are disclosed in PCT Publication WO96/34099. These 
transgenic animal models have been shown to have altered Ap production and altered 
hippocampus-dependent memory function. 

25 Animal species suitable for use in the animal models of the present 

invention include, but are not limited to, rats, mice, hamsters, guinea pigs, rabbits, 
dogs, cats, goats, sheep, pigs, and non-human primates (e.g.. Rhesus monkeys, 
chimpanzees). For initial studies, transgenic rodents (e.g., mice) may be preferred due 
to their relative ease of maintenance and shorter life spans. However, transgenic yeast 

30 or invertebrates (e.g., nematodes, insects) may be preferred for some studies because 
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they will allow for even more rapid and inexpensive screening. For example, 
invertebrates bearing mutant PS-interacting protein homologues (or mammalian PS- 
interacting protein transgenes) which cause a rapidly occurring and easily scored 
phenotype (e.g., abnomial vulva or eye development after several days) can be used a 
5 screens for drugs which block the effect of the mutant gene. Such invertebrates may 
prove far more rapid and efficient for mass screenings than larger vertebrate animals. 
Once lead compounds are found through such screens, they may be tested in higher 
animals such a rodents. Ultimately, transgenic non-human primates may be preferred 
for longer term studies due to their greater similarity to humans and their higher 
0 cognitive abilities. 

Using the nucleic acids disclosed and otherwise enabled herein, there are 
now several available approaches for the creation of a transgenic animal model for 
Alzheimer's Disease. Thus, the enabled animal models include: (1) Animals in which 
sequences encoding at least a functional domain of a normal human PS-interacting 
protein gene have been recombinantly introduced into the genome of the animal as an 
additional gene, under the regulation of either an exogenous or an endogenous 
promoter element, and as either a minigene or a large genomic fragment; in which 
sequences encoding at least a functional domain of a normal human PS-interacting 
protein gene have been recombinantly substimted for one or both copies of the 
animal's homologous PS-interacting protein gene by homologous recombination or 
gene targeting; and/or in which one or both copies of one of the animal's homologous 
PS-interacting protein genes have been recombinantly "humanized" by the partial 
substitution of sequences encoding the human homologue by homologous 
recombination or gene targeting. These animals are useful for evaluating the effects 
of the transgenic procedures, and the effects of the introduction or substitution of a 
human or humanized PS-interacting protein gene. (2) Animals in which sequences 
encoding at least a functional domain of a mutant (i.e., pathogenic) human PS- 
interacting protein gene have been recombinantly introduced into the genome of the 
animal as an additional gene, under the regulation of either an exogenous or an 
endogenous promoter element, and as either a minigene or a large genomic fragment; 
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in which sequences encoding at least a functional domain of a mutant human PS- 
interacting protein gene have been recombinanily substituted for one or both copies of 
the animal's homologous PS-interacting protein gene by homologous recombination 
or gene targeting; and/or in which one or both copies of one of the animal's 
5 homologous PS-interacting protein genes have been recombinantly "humanized" by 
the partial substitution of sequences encoding a mutant human homologue by 
homologous recombination or gene targeting. These animals are useful as models 
which will display some or all of the characteristics, whether at the biochemical, 
physiological and/or behavioral level, of humans carrying one or more alleles which 
10 are pathogenic of Alzheimer's Disease or other diseases associated with mutations in 
the PS-interacting protein genes. (3) Animals in which sequences encoding at least a 
functional domain of a mutant version of one of that animal's PS-interacting protein 
genes (bearing, for example, a specific mutation corresponding to, or similar to, one 
of the pathogenic mutations of the human PS-interacting proteins) have been 
15 recombinantly introduced into the genome of the animal as an additional gene, under 
the regulation of either an exogenous or an endogenous promoter element, and as 
either a minigene or a large genomic fragment; and/or in which sequences encoding at 
least a functional domam of a mutant version of one of that animal's PS-interacting 
protein genes (bearing, for example, a specific mutation corresponding to, or similar 
20 to, one of the pathogenic mutations of the human PS-interacting proteins) have been 
recombinantly substituted for one or both copies of the animal's homologous PS- 
interacting protein gene by homologous recombination or gene targeting. These 
animals are also useful as models which will display some or all of the characteristics, 
whether at the biochemical, physiological and/or behavioral level, of humans carrying 
25 one or more alleles which are pathogenic of Alzheimer's Disease. (4) "Knock-out" 
animals in which one or both copies of one of the animal's PS-interacting protein 
genes have been partially or completely deleted by homologous recombination or 
gene targeting, or have been inactivated by the insertion or substitution by 
homologous recombination or gene targeting of exogenous sequences (e.g., stop 
30 codons, lox p sites). Such animals are useful models to study the effects which loss of 
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PS-interacting protein gene expression may have, to evaluate whether loss of ftinction 
is preferable to continued expression of mutant forms, and to examine whether other 
genes can be recruited to replace a mutant PS-interacting protein or to intervene with 
the effects of other genes (e.g., PSl, PS2. APP or ApoE) causing AD as a treatment 
for AD or other disorders. For example, a normal PS-interacting protein gene may be 
necessary for the action of mutant presenilin or APP genes to actually be expressed as 
AD and, therefore, transgenic PS-interacting protein animal models may be of use in 
elucidating such multigenic interactions. 

In addition to transgenic animal models in which the expression of one or 
more of the PS-interacting proteins is altered, the present invention also provides for 
the production of transgenic animal models in which the expression of one or more of 
the presenilins, APP, or ApoE is altered. The nucleic acids encoding the presenilins, 
APP, and ApoE are known in the art, a methods for producing transgenic animals with 
these sequences are also known (see, e.g., PCT Publication WO96/34099; Games et 
al., 1995). Indeed, because non-human animals may differ from humans not only in 
their PS-interacting protein sequences, but also in the sequences of their presenilin, 
APP and/or ApoE homologues. it is particularly contemplated that transgenics may be 
produced which bear recombinant normal or mutant human sequences for at least one 
presenilin, APP and/or ApoE gene in addition to recombinant sequences for one or 
more PS-interacting proteins. Such co-transfonned animal models would possess 
more elements of the human molecular biology and, therefore, are expected to be 
better models of human disorders. Thus, in accordance with the present invention, 
transgenic animal models may be produced bearing normal or mutant sequences for 
one or more PS-interacting proteins, or interacting domains of these proteins. These 
animals will have utility in that they can be crossed with animals bearing a variety of 
normal or mutant presenilin. APP or ApoE sequences to produce co-transfonned 
animal models. Furthermore, as detailed below, it is expected that mutations in the 
PS-interacting genes, like mutations in the presenilins themselves, may be causative 
of Alzheimer's Disease and/or other disorders as well (e.g.. other cognitive, 
intellectual, neurological or psychological disorders such as cerebral hemontage, 
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schizophrenia, depression, mental retardation and epilepsy). Therefore, transgenic 
animal models beiaring normal or mutant sequences conresponding to the PS- 
interacting proteins, absent transformation with any presenilin, APP or ApoE 
sequences, will have utility of their own in the study of such disorders. 
5 As detailed below, preferred choices for transgenic animal models 

transformed with PS-interacting proteins, or domains of PS -interacting proteins, 
include those transformed with normal or mutant sequences corresponding to the 
clones identified and described in Example 1 and disclosed in SEQ ID NOs: 1-12. 
These clones, which interact with normal or mutant PS 1 TM6->7 loop domains, were 

10 identified according to the methods described in Example 1 , below, and PCT 

Publication WO96/34099. These clones, longer nucleic acid sequences comprising 
these clones, and other clones identified according to this and other methods of the 
invention (e.g., allelic and splice variants or heterospecific homologues of these 
clones) may all be employed in accordance with the present invention to produce 

15 animal models which, with or without co-transformation with presenilin, APP and/or 
ApoE sequences, will have utility in the study of Alzheimer's Disease and/or other 
cognitive, intellectual, nem-ological or psychological disorders. 

Thus, using the nucleic acids disclosed and otherwise enabled herein, one 
of ordinary skill in the art may now produce any of the following types of transgenic 

20 animal models with altered PS-interacting protein expression: (1) Animals in which 
sequences encoding at least a functional domain of a normal human PS-interacting 
protein gene have been recombinantly introduced into the genome of the animal as an 
additional gene, under the regulation of either an exogenous or an endogenous 
promoter element, and as either a minigene or a large genomic fragment; in which 

25 sequences encoding at least a functional domain of a normal human PS-interacting 
protein gene have been recombinantly substituted for one or both copies of the 
animals homologous PS-interacting protein gene by homologous recombination or 
gene targeting; and/or in which one or both copies of one of the animal's homologous 
PS-interacting protein genes have been recombinantly "humanized" by the partial 

30 substitution of sequences encoding the human homologue by homologous 
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recombination or gene targeting. These animals are panicularly useful for providing 
transgenic models which express human PS-interacting proteins as ^yell as human 
presenilin proteins. They are also usefiil in evaluating the effects of the transgenic 
procedures, and the effects of the introduction or substitution of a human or 
5 humanized PS-interacting protein gene. (2) Animals in which sequences encoding at 
least a functional domain of a mutant (i.e., pathogenic) human PS-interacting protein 
gene have been recombinantly introduced into the genome of the animal as an 
additional gene, under the regulation of either an exogenous or an endogenous 
promoter element, and as either a minigene or a large genomic fragment; in which 
10 sequences encoding at least a fimctional domain of a mutant human PS-interacung 
protein gene have been recombinantly substinjted for one or both copies of the 
animal's homologous PS-interacting protein gene by homologous recombination or 
gene targeting; and/or in which one or both copies of one of the animal's homologous 
PS-interacting protein genes have been recombinantly "humanized" by the partial 
15 substitution of sequences encoding a mutant human homologue by homologous 
recombination or gene targeting. These animals are useful as models which will 
display some or all of the characteristics, whether at the biochemical, physiological 
and/or behavioral level, of humans carrying one or more alleles which are pathogenic 
of Alzheimer's Disease or other diseases associated with mutations in these PS- 
20 interacting genes. (3) Animals in which sequences encoding at least a functional 
domain of a mutant version of one of that animal's PS-interacting protein genes 
(bearing, for example, a specific mutation corresponding to, or similar to, one of the 
pathogenic mutations of the human PS-interacting proteins) have been recombinantly 
introduced into the genome of the animal as an additional gene, under the regulation 
25 of either an exogenous or an endogenous promoter element, and as either a minigene 
or a large genomic fragment; and/or in which sequences encoding at least a functional 
domain of a mutant version of one of that animal's PS-interacting protein genes 
(bearing, for example, a specific mutation corresponding to, or similar to, one of the 
pathogenic mutations of the humans PS-interacting proteins) have been recombinantly 
30 substituted for one or both copies of the animal's homologous PS-interacting protein 
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gene by homologous recombination or gene targeting. These animals are also useftil 
as models which will display some or all of the characteristics, whether at the 
biochemical, physiological and/or behavioral level, of humans carrying one or more 
alleles which are pathogenic of Alzheimer's Disease. (4) "Knock-out" animals in 
5 which one or both copies of one of the animal's PS-interacting protein genes have 
been partially or completely deleted by homologous recombination or gene targeting, 
or have been inactivated by the insertion or substitution by homologous 
recombination or gene targeting of exogenous sequences (e.g., stop codons, lox p 
sites). Such animals are useful models to study the effects which loss of PS- 

10 interacting protein gene expression may have, to evaluate whether loss of function is 
preferable to continued expression, and to examine whether other genes can be 
recruited to replace a mutant PS-interacting protein or to intervene with the effects of 
other genes (e.g., APP or ApoE) causing AD as a treatment for AD or other disorders. 
For example, a normal PS-interacting protein may be necessary for the action of 

15 mutant PSl, PS2 or APP genes to actually be expressed as AD and, therefore, 

transgenic PS-interacting protein animal models may be of use in elucidating such 
multigenic interactions. 

In some preferred embodiments, transgenic animal models are produced in 
which just the PS-interacting domains of the PS-interacting proteins are introduced 

20 into the genome of the animal by homologous recombination. Thus, for example, 
preferred embodiments include transgenic animals in which the PS-interacting 
domains of PS-interacting proteins are "humanized" by homologous recombination 
with sequences from human PS-interacting proteins. These animals may then be bred 
with transgenics in which normal or mutant presenilin sequences have been 

25 introduced. The progeny of these animals, having both human presenilin and hiiman 
PS-interacting protein sequences, will provide improved animal models for 
Alzheimer's Disease. 

To create an animal model (e.g., a transgenic mouse), a normal or mutant 
PS-interacting gene (e.g., normal or mutant S5a, GT24, p0071 , Rabl 1 , etc.), or a 

30 normal or mutant version of a recombinant nucleic acid encoding at least a functional 
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domain of a PS-interacting gene (e.g.. the PS-interacting domains obtained in the 
yeast two-hybrid system), can be inserted into a genn line or stem cell using standard 
techniques of oocyte microinjection, or transfection or microinjection into embryonic 

stem cells.. Animals produced by these or similar processes are referred to as 
5 transgenic. Similarly, if it is desired to inactivate or replace an endogenous presenilin 
or PS-interacting protein gene, homologous recombination using embryonic stem 
cells may be employed. Animals produced by these or similar processes are referred 
to as "knock-out" (inactivation) or "knock-in" (replacemem) models. 

For oocyte injection, one or more copies of the recombinant DNA 
10 constructs of the present invention may be inserted into the pronucleus of a just- 
feitilized oocyte. This oocyte is then reimplanted into a pseudo-pregnant foster 
mother. The livebom animals are screened for integrants using analysis of DNA (e.g.. 
ftom the tail veins of offspring mice) for the presence of the inserted recombinant 
transgene sequences. The transgene may be either a complete genomic sequence 
15 injected as a YAC. BAG. PAC or other chromosome DNA fragment, a cDNA with 
either the natural promoter or a heterologous promoter, or a minigene containing all of 
the coding region and other elerhents found to be necessary for optimum expression. 

Retroviral infection of early embryos can also be done to insert the 
recombinant DNA constructs of the invention. In this method, the transgene (e.g., a 
20 nomial or mutant S5a, GT24. p0071. Rab 1 1. etc., sequence) is inserted into a 

retroviral vector which is used to infect embryos (e.g.. mouse or non-human primate 
embryos) directly during the early stages of development to generate chimeras, some 
of which will lead to germline transmission. 

Homologous recombination using stem cells allows for the screening of 
gene transfer cells to identify the rare homologous recombination events. Once 
identified, these can be used to generate chimeras by injection of blastocysts, and a 
proportion of the resulting animals will show germline transmission from the 
recombinant line. This methodology is especially usefulifinactivationofa gene is 
desired. For example, inactivation of the S5a gene in mice may be accomplished by 
designing a DNA fragment which contains sequences from an S5a coding region 
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flanking a selectable marker. Homologous recombination leads to the insertion of the 
marker sequences in the middle of the coding region, causing inactivation of the S5a 
gene and/or deletion of internal sequences. DNA analysis of individual clones can 
then be used to recognize the homologous recombination events. 
5 The techniques of generating transgenic animals, as well as the techniques 

for homologous recombination or gene targeting, are now widely accepted and 
practiced. A laboratory manual on the manipulation of the mouse embryo, for 
example, is available detailing standard laboratory techniques for the production of 
transgenic mice (Hogan et al., 1986). To create a transgene, the target sequence of 
10 interest (e.g., normal or mutant presenilin sequences, normal or mutant PS -interacting 
protein sequences) are typically ligated into a cloning site located downstream of 
some promoter element which will regulate the expression of RNA from the 
sequence. Downstream of the coding sequence, there is typically an artificial 
polyadenylation sequence. In the transgenic models that have been used to 
15 successfully create animals which mimic aspects of inherited human 

neurodegenerative diseases, the most successful promoter elements have been the 
platelet-derived growth factor recqjtor p gene subunit promoter and the hamster prion 
protein gene promoter, although other promoter elements which direct expression in 
central nervous system cells would also be useful. An alternate approach to creating a 
20 transgene is to use an endogenous presenilin or PS-interacting protein gene promoter 
and regulatory sequences to drive expression of the transgene. Finally, it is possible 
to create transgenes using large genomic DNA fragments such as YACs which 
contain the entire desired gene as well as its appropriate regulatory sequences. Such 
constructs have been successfully used to drive human APP expression in transgenic 
25 mice (Lamb etal., 1993). 

Animal models can also be created by targeting the endogenous presenilin 
or PS-interacting protein gene in order to alter the endogenous sequence by 
homologous recombination. These targeting events can have the effect of removing 
endogenous sequence (knock-out) or altering the endogenous sequence to create an 
30 amino acid change associated with human disease or an otherwise abnormal sequence 
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(e.g.. a sequence which is more like the human sequence than the origmal animal 
sequence) (knock-in animal models). A large number of vectors are available to 
accomplish this and appropriate sources.of genomic DNA for mouse and other animal 
genomes to be targeted are commercially available from companies such as 
i GenomeSystems Inc. (St. Louis. Missouri. USA). The typical feature of these 
targeting vector constnicts is that 2 to 4 kb of genomic DNA is ligated 5' to a 
selectable marker (e.g.. a bacterial neomycin resistance gene under its own promoter 
element termed a "neomycin cassette"). A second DNA fragment from the gene of 
interest is then ligated downstream of the neomycin cassette but upstream of a second 
selectable marker (e.g.. thymidine kinase). The DNA fragments are chosen such that 
mutant sequences can be introduced into the germ line of the targeted animal by 
homologous replacement of the endogenous sequences by either one of the sequences 
included in the vector. Alternatively, the sequences can be chosen to cause deletion of 
sequences that would nonnally reside between the left and right amis of the vector 
sun-ounding the neomycin cassette. The former is known as a knock-in. the latter is 
known as a knock-out. Again, imiumerable model systems have been created, 
particularly for targeted knock-outs of genes including those relevant to 
neurodegenerative diseases (e.g.. targeted deletions of the murine APP gene by Zheng 
et al.. 1995; targeted deletion of the murine prion gene associated with adult onset 
human CNS degeneration by Bueler et al., 1996). 

Finally, equivalents of transgenic animals, including animals with mutated 
or inactivated presenilin genes, or mutated or inactivated PS-interacting protein genes, 
may be produced using chemical or X-ray mutagenesis of gametes, followed by 
fertilization. Using the isolated nucleic acids disclosed or othenvise enabled herein, 
one of ordinary skill may more rapidly screen the resulting offspring by, for example, 
direct sequencing RFLP. PGR, or hybridization analysis to detect mutants, or 
Southern blotting to demonstrate loss of one allele by dosage. 

6: Assays for Dmes Whirh Affr.t P S-Inten^crin,. Pmr.in .P.^....i^„ 
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In another series of embodiments, the present invention provides assays for 
identifying small molecules or other compounds which are capable of inducing or 
inhibiting the expression of the PS-interacting genes and proteins (e.g., S5a or GT24). 
The assays may be performed in vitro using non-transformed cells, immortalized cell 
5 lines, or recombinant cell lines, or in vivo using the transgenic animal models enabled 
herein. 

In particular, the assays may detect the presence of increased or decreased 
expression of S5a, GT24, p0071, Rab 1 1, or other PS-interacting genes or proteins on 
the basis of increased or decreased mRNA expression (using, e.g., the nucleic acid 

10 probes disclosed arid enabled herein), increased or decreased levels of PS-interacting 
proteins (using, e.g., the anti-PS-interacting protein antibodies disclosed and enabled 
herein), or increased or decreased levels of expression of a marker gene (e.g., P- 
galactosidase or luciferase) operably joined to a PS-interacting protein 5' regulatory 
region in a recombinant construct. 

15 Thus, for example, one may culture cells known to express a particular PS- 

interacting protein and add to the culture medium one or more test compounds. After 
allowing a sufficient period of time (e.g., 0-72 hours) for the compound to induce or 
inhibit the expression of the PS-interacting protein, any change in levels of expression 
from an established baseline may be detected using any of the techniques described 

20 above and well known in the art. In particularly preferred embodiments, the cells are 
from an immortalized cell line such as a human neuroblastoma, glioblastoma or a 
hybridoma cell line. Using the nucleic acid probes and /or antibodies disclosed and 
enabled herein, detection of changes in the expression of a PS-interacting protein, and 
thus identification of the compound as an inducer or repressor of PS-interacting 

25 protein expression, requires only routine experimentation. 

In particularly preferred embodiments, a recombinant assay is employed in 
which a reporter gene such a P-galactosidase, green fluorescent protein , alkaline 
phosphatase, or luciferase is. operably joined to the 5' regulatory regions of a PS- 
interacting protein gene. Preferred vectors include the Green Lantern 1 vector 

30 (GIBCO/BRL, Gaithersburg, MD) and the Great EScAPe pSEAP vector (Clontech, 
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Palo Alto). The PS-interacting protein regulatory regions may be easily isolated and 
cloned by one of ordinary skill in the art in light of the present disclosure of coding 
regions from these genes. The reporter gene and regulatory regions are joined in- 
frame (or in each of the three possible reading frames) so that transcription and 
5 translation of the reporter gene may proceed under the control of the PS-interacting 
protein regulatory elements. The recombinant construct may then be introduced into 
any appropriate cell type, although mammalian cells are preferred, and human cells 
are most preferred. The transformed cells may be grown in culture and, after 
establishing the baseline level of expression of the reporter gene, test compounds may 
10 be added to the medium. The ease of detection of the expression of the reporter gene 
provides for a rapid, high through-put assay for the identification of inducers and 
repressors of the PS-interacting protein gene. 

Compounds identified by this method will have potential utility in 
modifying the expression of the PS-interacting protein genes in vivo . These 
15 compounds may be further tested in the animal models disclosed and enabled herein 
to identify those compounds having the most potent in vivo effects. In addition, as 
described herein with respect to small molecules having binding activity for PS- 
mteracting proteins, these molecules may serve as "lead compounds" for the further 
development of pharmaceuticals by, for example, subjecting the compounds to 
sequential modifications, molecular modeling, and other routine procedures employed 
in rational drug design. 



20 



25 



''• Mentification of Compounds with P-S-Tntpr acting Protein Binding Tap arity 

In light of the present disclosure, one of ordinary skill in the art is enabled 
to practice new screening methodologies which will be useftil in the identification of 
proteins and other compounds which bind to, or otherwise directly interact with, the 
PS-interacting proteins. The proteins and compounds will include endogenous 
cellular components, aside from the presehilins, which interact with the PS-interacting 
proteins invivo and which, therefore, provide new targets for pharmaceutical and 
30 therapeutic interventions, as well as recombinant, synthetic and otherwise exogenous 
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compounds which may have PS-interacting protein binding capacity and, therefore, 
may be candidates for pharmaceutical agents. Thus, in one series of embodiments, 
cell lysates or tissue homogenates (e.g., human brain homogenates, lymphocyte 
lysates) may be screened for proteins or other compounds which bind to one of the 
5 normal or mutant PS-interacting proteins. Alternatively, any of a variety of 

exogenous compounds, both naturally occurring and/or synthetic (e.g., libraries of 
small molecules or peptides), may be screened for PS-interacting protein binding 
capacity. Small molecules are particular preferred in this context because they are 
more readily absorbed after oral administration, have fewer potential antigenic 

10 determinants, and/or are more likely to cross the blood brain barrier than larger 

molecules such as nucleic acids or proteins. The methods of the present invention are 
particularly useful in that they may be used to identify molecules which selectively or 
preferentially bind to a mutant form of a PS-interacting protein (rather than a normal 
form) and, therefore, may have particular utility in treating cases of AD which arise 

15 from mutations in the PS-interacting proteins. 

Once identified by the methods described above, the candidate compounds 
may then be produced in quantities sufficient for pharmaceutical administration or 
testing (e.g., ^g or mg or greater quantities), and formulated in a phaimaceutically 
acceptable carrier (see, e.g.. Remington's Pharmaceutical Sciences Gennaro, A., ed., 

20 Mack Pub., 1990). These candidate compounds may then be administered to the 

transformed cells of the invention, to the transgenic animal models of the invOTtion, to 
cell lines derived from the animal models or from human patients, or to Alzheimer's 
patients. The animal models described and enabled herein are of particular utility in 
further testing candidate compounds which bind to normal or mutant PS-interacting 

25 proteins for their therapeutic efficacy. 

In addition, once identified by the methods described above, the candidate 
compounds may also serve as "lead compounds" in the design and development of 
new pharmaceuticals. For example, as in well known in the art, sequential 
modification of small molecules (e.g., amino acid residue replacement with peptides; 

30 functional group replacement with peptide or non-peptide compounds) is a standard 
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approach in the pharmaceutical industry for the development of new phannaceuticals. 
Such development generally proceeds from a "lead compound" which is shown to 
have at least some of the activity (e.g., PS-interacting protein binding or blocking 
ability) of the desired pharmaceutical. In particular, when one or more compounds 
5 having at least some activity of interest (e.g., modulation of PS-interacting protein 
activity) are identified, structural comparison of the molecules can greatly inform the 
skilled practitioner by suggesUng portions of the lead compounds which should be 
conserved and portions which may be varied in the design of new candidate 
compounds. Thus, the present invention also provides a means of identifying lead 
10 compounds which may be sequentially modified to produce new candidate 

compounds for use in the treatment of Alzheimer's Disease. These new compounds 
then may be tested both for binding to PS-interacting proteins and/or blocking PS- 
interacting protein activity, and for therapeutic efficacy (e.g., in the animal models 
described herein). This procedure may be iterated until compounds having the desired 
15 therapeutic activity and/or efficacy are identified. 

In each of the present series of embodiments, an assay is conducted to 
detect binding between a "PS-interacting protein component" and some other moiety. 
Of particular utility will be sequential assays in which compounds are tested for the 
ability to bind to only normal or only mutant forms of the PS-interacting domains of 
10 PS-interacting proteins in the binding assays. Such compounds are expected to have 
the greatest therapeutic utilities, as described more fully belovy. The "PS-interacting 
protein component" in these assays may be a complete noonal or mutant form of a 
PS-interacting protein (e.g.. S5a, GT24, p0071. Rab 1 1, etc.) but need not be. Rather, 
particular functional domains of the PS-interacting proteins, particularly the PS- 
S interacting domains as described above, may be employed either as separate 
molecules or as pan of a fusion protein. For example, to isolate proteins or 
compounds that interact with these functional domains, screening may be carried out 

using fusion constructs and/or synthetic peptides corresponding to these regions. 
Thus, for S5a, GST-fusion peptides may be made including sequences corresponding 
approximately to amino acids 70-377 of SEQ ID NO: 2 (included in clones Y2H29 
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and Y2H31, see Example 1), approximately to amino acids 206-377 of SEQ ID NO: 2 
(which includes protein-protein interaction motifs, see Ferrell et al., 1996), or to any 
other S5a domain of interest. Similarly, for GT24, GST- or other fusion peptides may 
be produced including sequences corresponding approximately to amino acids 440- 
5 815 of SEQ ID NO: 4 (including part of the annadillo repeat segment). Obviously, 
various combinations of fiision proteins and PS-interacting protein functional domains 
are possible and these are merely examples. In addition, the functional domains may 
be altered so as to aid in the assay by. for example, introducing into the functional 
domain a reactive group or amino acid residue (e.g., cysteine) which will facilitate 

10 immobilization of the domain on a substrate (e.g., using sulfhydryl reactions). Thus, 
for example, the PS-interacting domain of S5a may be synthesized containing an 
additional C-terminal cysteine residue to facilitate immobilization of the domain. 
Such peptides may be used to create an affinity substrate for affinity chromatography 
(Suifo-Iink; Pierce) to isolate binding proteins for microsequencing. Similarly, other 

15 functional domain or antigenic fragments may be created with modified residues (see, 
e.g., Example 4). 

The proteins or other compounds identified by these methods may be 
purified and characterized by any of the standard methods known in the art. Proteins 
may, for example, be purified and separated using electrophoretic (e.g., SDS-PAGE, 

20 2D PAGE) or chromatographic (e.g., HPLC) techniques and may then be 

microsequenced; For proteins with a blocked N-terminus, cleavage (e.g., by CNBr 
and/or trypsin) of the particular binding protein is used to release peptide fragments. 
Further purification/characterization by HPLC and microsequencing and/or mass 
spectrometry by conventional methods provides internal sequence data on such 

25 blocked proteins. For non-protein compounds, standard organic chemical analysis 
techniques (e.g., IR, NMR and mass spectrometry; functional group analysis; X-ray 
crystallography) may be employed to determine their structure and identity. 

Methods for screening cellular lysates, tissue homogenates, or small 
molecule libraries for candidate PS-interaction protein-binding molecules are well 

30 known in the art and, in light of the present disclosure, may now be employed to 
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identify compounds which bind to normal or mutant PS-interacting protein 
components or which modulate PS-interacting protein activity as defined by non- 
specific measures (e.g., changes in intracellular Ca^\ GTP/GDP ratio) or by specific 
measures (e.g., changes in AP peptide production or changes in the expression of 
other downstream genes which can be monitored by differential display. 2D gel 
electrophoresis, differential hybridization, or SAGE methods). The preferred methods 
involve variations on the following techniques: ( 1 ) direct extraction by affinity 
chromatography; (2) co-isolation of PS-interacting protein components and bound 
proteins or other compounds by immunoprecipitation; (3) the Biomolecular 
Interaction Assay (BIAcore); and (4) the yeast two-hybrid systems. These and others 
are discussed separately below. 

A. Affinity Chromatograp hy 

In light of the present disclosure, a variety of affinity binding techniques 
well known in the art may be employed to isolate proteins or other compounds which 
bind to the PS-interacting protein disclosed or otherwise enabled herein. In general, a 
PS-interacting protein component may be immobilized on a substrate (e.g., a column 
or filter) and a solution including the test compound(s) is contacted with the PS- 
interacting protein, fiision or fi^gment under conditions which are permissive for 
binding. The substrate is then washed with a solution to remove unbound or weakly 
bound molecules. A second wash may then elute those compounds which strongly 
bound to the immobilized nbnnal or mutant PS-interacting protein component. 
Alternatively, the test compounds may be immobilized and a solution containing one 
or more PS-interacting protein components may be contacted with the column, filter 
or other substrate. The ability of the PS-interacting protein component to bind to the 
test compounds may be determined as above or a labeled form of the PS-interacting 
protein component (e.g., a radio-labeled or chemiluminescent fimctional domain) may 
be used to more rapidly assess binding to the substrate-immobilized compound(s). 
B. Co-ImmunoDrecip itatinn 

Another well characterized technique for the isolation of PS-interacting 
protein components and their associated proteins or other compounds is direct 



SUBSTITUTE SHEET (RULE 26) 



wo 97/27296 



PCT/CA97/00051 



-54- 

immunoprecipitation with antibodies. This procedure has been successfully used, for 
example, to isolate many of the synaptic vesicle associated proteins (Phizicky and 
Fields, 1994). Thus, either normal or mutant, free or membrane-bound PS-interacting 
protein components may be mixed in a solution with the candidate compound(s) 
5 under conditions which are permissive for binding, and the PS-interacting protein 
component may be immunoprecipitated. Proteins or other compounds which co- 
immunoprecipitate with the PS-interacting protein component may then be identified 
by standard techniques as described above. General techniques for 
immunoprecipitation may be found in, for example, Harlow and Lane, (1988) 
^0 Antibodies: A Laboratory Manual . Cold Spring Harbor Press, Cold Spring Harbor, 
NY. 

The antibodies employed in tliis assay, as described and enabled herein, 
may be polyclonal or monoclonal, and include the various antibody fragments (e.g.. 
Fab, F(ab')2,) as well as single chain antibodies, and the like. 

15 C. The Biomolecular Interaction Assay 

Another useful method for the detection and isolation of binding proteins 
is the Biomolecular Interaction Assay or "BL\core" system developed by Phamiacia 
Biosensor and described in the manufacturer's protocol (LKB Pharmacia, Sweden). In 
light of the present disclosure, one of ordinary skill in the art is now enabled to 

20 employ this system, or a substantial equivalent, to identify proteins or other 

compounds having PS-interacting protein binding capacity. The BlAcore system uses 
an affinity purified anti-GST antibody to immobilize GST-fusion proteins onto a 
sensor chip. Obviously, other fusion proteins and corresponding antibodies may be 
substituted. The sensor utilizes surface plasmon resonance which is an optical 

25 phenomenon that detects changes in refractive indices. A homogenate of a tissue of 
interest is passed over the immobilized fusion protein and protein-protein interactions 
are registered as changes in the refractive index. This system can be used to 
determine the kinetics of binding and to assess whether any observed binding is of 
physiological relevance. 
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D. The Yeast T wo.HvhriH Sy gt^m 

The yeast "two-hybrid" system takes advantage of transcriptional factors 
that are composed of two physically separable, functional domains (Phizicky and 
Fields, 1 994). The most commonly used is the yeast GAL4 transcriptional activator 
i consisting of a DNA binding domain and a transcriptional activation domain. Two 
different cloning vectors are used to generate separate fusions of the GAL4 domains 
to genes encoding potential binding proteins. The fusion proteins are co-expressed, 
targeted to the nucleus and, if interactions occur, activation of a reporter gene (e.g.,' 
lacZ) produces a detectable phenotype. For example, the Clontech Matchmaker 
System-2 may be used with the Clontech brain cDNA GAL4 activation domain fusion 
library with PS-interacting protein-GAL4 binding domain fusion clones (Clontech. 
Palo Alto, CA). In light of the disclosures herein, one of ordinary skill in the art is 

now enabled to produce a variety of PS-interacting protein fusions, including fusions 

including either normal or mutant functional domains of the PS-interacting proteins. 

and to screen such fusion libraries in order to identify PS-interacting protein binding 

proteins. 

E. Other Methods 

The nucleotide sequences and protein products, including both mutant and 
npmial fomis of these nucleic acids and their corresponding proteins, can be used with 
the above techniques to isolate other interacting proteins, and to identify other genes 
whose expression is altered by the over-expression of normal PS-interacting protein 
sequences, by the under-expression of nomial PS-interacting pmtein sequences, or by 
the expression of mutant PS-interacting protein sequences. Identification of these 
other interacting proteins, as well as the identification of other genes whose 
expression levels are altered in AD will identify other gene targets which have direct 
relevance to the pathogenesis of this disease in its clinical or pathological forms. 
Specifically, other genes will be identified which may themselves be the site of other 
mutations causing Alzheimer's Disease, or which can themselves be targeted 
therapeutically (e.g.. to reduce their expression levels to normal, or to 
phamaacologically block the effects of their over-expression) as a potential treatment 
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for this disease. Specifically, these techniques rely on PCR-based and/or 
hybridization-based methods to identify genes which are differentially expressed 
between two conditions (a cell line expressing normal PS-interacting proteins 
compared to the same cell type expressing a mutant PS-interacting protein). These 
5 techniques include differential display, serial analysis of gene expression (SAGE), and 
mass-spectrometry of protein 2D-gels and subtractive hybridization (reviewed in 
Nowak, 1 995 and Kahn, 1 995). ^ 

As will be obvious to one of ordinary skill in the art, there are numerous 
other methods of screening individual proteins or other compounds, as well as large 

10 libraries of proteins or other compounds (e.g., phage display libraries and cloning 

systems from Stratagene, La Jolla, CA) to identify molecules which bind to normal or 
mutant PS-interactirig protein components. All of these methods comprise the step of 
mixing a normal or mutant PS-interacting protein, fusion, or fragment with test 
compounds, allowing for binding (if any), and assaying for bound complexes. All 

15 such methods are now enabled by the present disclosure of substantially pure PS- 
interacting proteins, substantially pure PS-interacting functional domain fragments, 
PS-interacting protein fusion proteins, PS-interacting protein antibodies, and methods 
of making and using the same. 

20 8. Disrupting PS*Interacting Protein Interactions 

The ability to disrupt specific interactions of the PS-interacting proteins 
with the presenilins, or with other proteins, is potentially of great therapeutic value, 
and will be important in understanding the etiology of AD and in identifying 
additional targets for therapy. The methods used to identify compounds which disrupt 

25 PS-interacting protein interactions may be applied equally well to interactions 
involving either normal or mutant PS-interacting proteins. 

Assays for compounds which can dismpt PS-interacting protein 
interactions may be performed by any of a variety of methods well known in the art. 
In essence, such assays will parallel those assays for identifying proteins and 

30 compounds with binding activity toward the PS-interacting proteins. Thus, once a 
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compound with binding activity for a PS-interacting protein is identified by any 
method, that method or an equivalent method may be performed in the presence of 
candidate compounds to identify compounds which disrupt the interaction. Thus, for 
example, the assay may employ methods including (1) affinity chromatography; (2) 
immunoprecipitation; (3) the Biomolecular Interaction Assay (BIAcor^); or (4ithe 
yeast two-hybrid systems. Such assays can be developed using either nonnal or 
mutant purified PS-interacting proteins, and/or either normal or mutant purified 
binding proteins (e.g., normal or mutant presenilins). 

For affinity methods, either the PS-interacting protein or its binding 
partner may be affixed to a matrix, for example in a column, and the counterpart 
protein (e.g., the PS-interacting protein if presenilin or another binding partner is 
affixed to the matrix; or a presenilin or other binding partner if the PS-interacting 
protein is affixed to the matrix) is then exposed to the affixed protein/compound either 
before or after adding the candidate compound(s). In the absence of a disruptive 
effect by the candidate compound(s), the interaction between the PS-inteiacting 
protein and its binding partner will cause the counterpart protein to bind to the affixed 
protein. Any compound which disrupts the interaction will cause release of the 
counterpart protein from the matrix. Release of the counterpart protein from the 
matrix can be measured using methods known in the art. 

For PS-interacting protein interactions which are detectable by yeast two- 
hybrid systems, these assays may also be employed to identify compounds which 
disrupt the interaction. Briefly, a PS-interacting protein and its binding partner (or 
appropriate structural domains of each) are employed in the fusion proteins of the 
system, and the cells are exposed to candidate compounds to detemiine their effect 
upon the expression of die reporter gene. By appropriate choice of a reporter gene, 
such a system can be readily adapted for high through-put screening of large libraries 
of compounds by, for example, using a reporter gene which confers resistance to an 
antibiotic which is present in the medium, or which rescues an auxotrophic strain 
grown in minimal medium. 
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These assays may be used to screen many different types of compounds for 
their disruptive effect on the interactions of the PS-interacling proteins. For example, 
the compounds may belong to a library of synthetic molecules, or be specifically 
designed to disrupt the interaction. The compounds may also be peptides 
5 corresponding to the interacting domain of either protein. This type of assay can be 
used to identify compounds that disrupt a specific interaction between a given PS- 
interacting protein variant and a given binding partner. In addition, compounds that 
disrupt all interactions with PS-interacting proteins may be identified, For example, a 
compound that specifically disrupts the folding of PS-interacting proteins would be 
10 expected to disrupt all interactions between PS-interacting proteins and other proteins. 
Alternatively, this type of disruption assay can be used to identify compounds which 
disrupt only a range of different PS-interacting protein interactions, or only a single 
PS-interacting protein interaction. 

15 9. Methods of Identifvine Compounds Modulating PS-Interacting Protein Activity 
In another series of embodiments, the present invention provides for 
methods of identifying compounds with the ability to modulate the activity of normal 
and mutant PS-interacting proteins. As used with respect to this series of 
embodiments, the term "activity" broadly includes gene and protein expression, PS- 

20 interacting protein post-translation processing, trafficking and localization, and any 
functional activity (e.g., enzymatic, receptor-effector, binding, channel), as well as 
downstream affects of any of these. It is known that Alzheimer's Disease is associated 
with increased production of the long form of AP peptides, the appearance of amyloid 
plaques and neurofibrillary tangles, decreases in cognitive function, and apoptotic cell 

25 death. Therefore, using the transformed cells and transgenic animal models of the 
present invention, cells obtained from subjectis bearing normal or mutant PS- 
interacting protein genes, or animals or human subjects bearing naturally occurring 
normal or mutant PS-interacting proteins, it is now possible to screen candidate 
pharmaceuticals and treatments for their therapeutic effects by detecting changes in 
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one or more of these functional characteristics or phenotypic manifestations of normal 
or mutant PS-interacting protein expression. 

Thus, the present invention provides methods for screening or assaying for 
proteins, small molecules or other compounds which modulate PS-interacting protein 
activity by contacting a cell in vivo or in vitro with a candidate compound and 
assaying for a change in a marker associated with nonnal or mutant PS-interacting 
protein activity. The marker associated with PS-interacting protein activity may be 
any measurable biochemical, physiological, histological and/or behavioral 
characteristic associated with PS-interacting protein expression. In particular, usefiil 
markers will include any measurable biochemical, physiological, histological and/or 
behavioral characteristic which distinguishes cells, tissues, animals or individuals 
bearing at least one mutant presenilin or PS-interacting protein gene from their normal 
counterparts. In addition, the marker may be any specific or non-specific measure of 
presenilin or PS-interacting protein activity. PS-interacting protein specific measures 
include measures of PS-interacting protein expression (e.g., PS-interacting protein 
mRNA or protein levels) which may employ the nucleic acid probes or antibodies of 
the present invention. Non-specific measures include changes in cell physiology such 
as pH, intracellular calcium, cyclic AMP levels, GTP/GDP ratios, 
phosphatidylinositol activity, protein phosphorylation, etc., which can be monitored 
on devices such as the cytosensor microphysiometer (Molecular Devices Inc., United 
States). The activation or inhibition of PS-interacting protein activity in its mutant or 
nonnal form can also be monitored by examining changes in the expression of other 
genes (e.g., the presenilins) which are specific to the PS-interacting protein pathway 
leading to Alzheimer's Disease. These can be assayed by such techniques as 
differential display, differential hybridization, and SAGE (sequential analysis of gene 
expression), as well as by two dimensional gel electrophoresis of cellular lysates. In 
each case, the differentially-expressed genes can be ascertained by inspection of 
identical studies before and after application of the candidate compound. 
Furthermore, as noted elsewhere, the particular genes whose expression is modulated 
by the administration of the candidate compound can be ascertained by cloning. 
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nucleotide sequencing, amino acid sequencing, or mass spectrometry (reviewed in 
Nowak, 1995). 

In general, a cell may be contacted with a candidate compound and, after 
an appropriate period (e.g., 0-72 hours for most biochemical measures of cultured 
5 cells), the marker of presenilin or PS-interacting protein activity may be assayed and 
compared to a baseline measurement. The baseline measurement may be made prior 
to contacting the cell with the candidate compound or may be an external baseline 
established by other experiments or known in the art. The cell may be a transformed 
cell of the present invention or an explant from an animal or individual. In particular, 

10 the cell may be an explant from a carrier of a presenilin or PS-interacting protein 

mutation (e.g., a human subject with Alzheimer's Disease) or an animal model of the 
invention (e.g., a transgenic nematode or mouse bearing a mutant presenilin or PS- 
interacting protein gene). To augment the effect of presenilin or PS-interacting 
protein mutations on the Ap pathway, transgenic cells or animals may be employed 

15 which have increased Ap production. Preferred cells include those from neurological 
tissues such as neuronal, glial or mixed cell cultures; and cultiired fibroblasts, liver, 
kidney, spleen, or bone marrow. The cells may be contacted with the candidate 
compounds in a culture in vitro or may be administered in vivo to a live animal or 
human subject. For live animals or human subjects, the test compound may be 

20 administered orally or by any parenteral route suitable to the compound. For clinical 
trials of human subjects, measurements may be conducted periodically (e.g., daily, 
weekly or monthly) for several months or years. 

Because most individuals bearing a mutation in a particular gene are 
heterozygous at that locus (i.e., bearing one noraial and one mutant allele), 

25 compounds may be tested for their ability to modulate normal as well as mutant 
presenilin or PS-interacting protein activity. Thus, for example, compounds which 
enhance the function of normal presenilins or PS-interacting proteins may have utility 
in treating Alzheimer's Disease or related disorders. Alternatively, because 
suppression of the activity of both normal and mutant copies of a gene in a 

30 heterozygous individual may have less severe clinical consequences than progression 
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of the associated disease, it may be desired to identify compound which inactivate or 
suppress all forms of the presenilins, the PS-interacting proteins, or their interactions. 
Preferably, however, compounds are identified which selectively or specifically 
inactivate or suppress the activity of mutant presenilin or PS-interacting proteins 
without disrupting the function of their normal counteiparts. 

In light of the identification, characterization, and disclosure herein of a 
novel group of PS-interacting genes and proteins, the PS-interacting protein nucleic, 
acid probes and antibodies, and the PS-interacting protein transformed cells and 
transgenic animals of the invention, one of ordinary skill in the art is now enabled by 
perform a great variety of assays which will detect the modulation of presenilin and/or 
PS-interacting protein activity by candidate compounds. Particularly preferred and 
contemplated embodiments are discussed in some detail below. 

A. PS-Interact ine Protein Expression 

In one series of embodiments, specific measures of PS-interacting protein 
expression are employed to screen candidate compounds for their ability to affect 
presenilin activity. Thus, using the PS-interacting protein nucleic acids and antibodies 
disclosed and otherwise enabled herein, one may use mRNA levels or protein levels 
as a marker for the ability of a candidate compound to modulate PS-interacting 
protein activity. The use of such probes and antibodies to measure gene and protein 
expression is well known in the art and discussed elsewhere herein. Of particular 
interest may be the identification of compounds which can alter the relative levels of 
different variants (e.g., mutant and nonnal) of the PS-interacting proteins. 
B. Intracellular Localization 

In another series of embodiments, compounds may be screened for their 
ability to modulate the activity of the PS-interacting proteins based upon their effects 
on the trafficking and intracellular localization of the PS-interacting proteins. The 
presenilins and some of the PS-interacting proteins (e.g.. S5a) have been seen 
immunocytochemically to be localized in membrane structures associated with the 
endoplasmic reticulum and Golgi apparatus. Differences in localization of mutant and 
nonnal presenilins or PS-interacting proteins may, therefore, contribute to the etiology 
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of Alzheimer's Disease and related disorders. Compounds which can affect the 
localization of these proteins may, therefore, be identified as potential therapeutics. 
Standard techniques known in the art may be employed to detect the localization of 
the presenilins and PS-interacting proteins. Generally, these techniques will employ 
5 the antibodies of the present invention, and in particular antibodies which selectively 
bind to one or more mutant PS-interacting proteins but not to normal proteins. As is 
well known in the art, such antibodies may be labeled by any of a variety of 
techniques (e.g., fluorescent or radioactive tags, labeled secondary antibodies, avidin- 
biotin, etc.) to aid in visualizing the intracellular location of these proteins. The PS- 

10 interacting proteins may be co-localized to particular structures, as in known in the 
art, using antibodies to markers of those structures (e.g., TGN38 for the Golgi, 
transferrin receptor for post-Golgi transport vesicles, LAMP2 for lysosomes). 
Western blots of purified fractions from cell lysates enriched for different intracellular 
membrane bound organelles (e.g., lysosomes, synaptosomes, Golgi) may also be 

15 employed. 

B. Ion Reeulation/Metabolism 

In another series of embodiments, compounds may be screened for their 
ability to modulate the activity of the presenilins or PS-interacting proteins based 
upon measures in intracellular Ca^\ Na* or levels or metabolism. As noted above, 

20 the presenilins are membrane associated proteins which may serve as, or interact with, 
ion receptors or ion channels. Thus, compounds may be screened for their ability to 
modulate presenilin and PS-interacting protein-related metabolism of calcium or other 
ions either in vivo or in vitro by, for example, measurements of ion channel fluxes 
and/or transmembrane voltage and/or current fluxes, using patch clamps, voltage 

25 clamps or fluorescent dyes sensitive to intracellular ion levels or transmembrane 
voltage. Ion charmel or receptor function can also be assayed by measurements of 
activation of second messengers such as cyclic AMP, cGMP tyrosine kinases, 
phosphates, increases in intracellular Ca^* levels, etc. Recombinantly made proteins 
may also be reconstructed in artificial membrane systems to study ion charmel 

30 conductance and, therefore, the "cell" employed in such assays may comprise an 
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artificial membrane or cell. Assays for changes in ion regulation or metabolism can 
be performed on cultured cells expressing endogenous normal or mutant presenilins 
and PS-interacting proteins. Such smdies also can be performed on cells transfected 
with vectors capable of expressing one of the presenilins or PS-interacting proteins, or 
functional domains of one of the presenilins or PS-interacting proteins, in normal or 
mutant form. In addition, to enhance the signal measured in such assays, cells may be 
co-transfected with genes encoding ion channel proteins. For example. Xenopus 
oocytes or rat kidney (HEK293) cells may be co-transfected with sequences encoding 
rat brain Na* pi subunits, rabbit skeletal muscle C^'* (Jl subunits. or rat heart pi 
subunits. Changes in presenilin or PS-interacting protein-mediated ion channel 
activity can be measured by. for example, two-microelectrode voltage-clamp 
recordings in oocytes, by whole-cell patch-clamp recordings in HEK293 cells, or by 
equivalent means. 

C. Apoptosi.'; n r Cell Deafh 

In another series of embodiments, compounds may be screened for their 
ability to modulate the activity of the presenilins or PS-interacting proteins based 
upon their effects on presenilin or PS-interacting protein-related apoptosis or cell 
death. Thus, for example, baseline rates of apoptosis or cell death may be established 
for cells in culture, or the baseline degree of neuronal loss at a particular age may be 
established post-mortem for animal models or human subjects, and the ability of a 
candidate compound to suppress or inhibit apoptosis or cell death may be measured. 
Cell death may be measured by standard microscopic techniques (e.g., light 
microscopy) or apoptosis may be measured more specifically by characteristic nuclear 
morphologies or DNA fragmentation patterns which create nucleosomal ladders (see, 
e.g., Gavrieli et al.. 1992; Jacobson et al., 1993; Vito et al., 1996). TUNEL may also 
be employed to evaluate cell death in brain (see, e.g., Lassmami et al.. 1995). In 
preferred embodiments, compounds are screened for their ability to suppress or inhibit 
neuronal loss in the transgenic animal models of the invention. Transgenic mice 
bearing, for example, a mutant human, mutant mouse, or humanized mutant presenilin 
or PS-interacting protein gene may be employed to identify or evaluate compounds 
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which may delay or arrest the neurodegeneration associated with Alzheimer's 
Disease. A similar transgenic mouse model, bearing a mutant APP gene, has recently 
been reported by Games et al. (1995). 

D. AB Peptide Production 
5 In another series of embodiments, compounds may be screened for their 

ability to modulate presenilin or PS-interacting protein-related changes in APP 
processing. The Ap peptide is produced in several isoforms resulting from differences 
in APP processing. The Ap peptide is a 39 to 43 amino acid derivative of PAPP 
which is progressively deposited in diffuse and senile plaques and in blood vessels of 

10 subjects with AD. In human brain, Ap peptides are heterogeneous at both the N- and 
C-termini. Several observations, however, suggest that both the foil length and N- 
terminal truncated forms of the long-tailed AP peptides ending at residue 42 or 43 
(i.e., Apl-42/43 and Apx-42/43) have a more important role in AD than do peptides 
ending at residue 40. Thus, Apl -42/43 and APx-42/43 are an early and prominent 

15 feature of both senile plaques and diffuse plaques, while peptides ending at residue 40 
(i.e., Api-40 and APx-40) are predominantly associated with a subset of mature 
plaques and with amyloidotic blood vessels (see, e.g., Iwatsubo et al., 1995; Gravina 
et al., 1995; Tamaoka et al., 1995; Podlisny et al. 1995). Furthermore, the long-tailed 
isoforms have a greater propensity to fibril formation, and are thought to be more 

20 neurotoxic than Apl-40 peptides (Pike et al., 1993; Hilbich et al„ 1991). Finally, 
missense mutations at codon 717 of the PAPP gene are associated with early onset 
FAD, and result in overproduction of long-tailed Ap in the brain of affected mutation 
carriers, in peripheral cells and plasma of both affected and presymptomatic carriers, 
and in cell lines transfected with pAPP,,, mutant cDNAs (Tamaoka et al., 1994; 

25 Suzuki etal., 1994). 

Thus, in one series of embodiments, the present invention provides 
methods for screening candidate compounds for their ability to block or inhibit the 
increased production of long isoforms of the Ap peptides in cells or transgenic 
animals expressing a normal or mutant presenilin gene and/or a normal or mutant PS- 

30 interacting protein gene. In particular, the present invention provides such methods in 
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which cultured manunalian cells, such as brain cells or fibroblasts, have been 
transformed according to the methods disclosed herein, or in which transgenic 
animals, such as rodents or non-human primates, have been produced by the methods 
disclosed herein, to express relatively high levels of a normal or mutant presenilin or 
5 PS-interacting protein. Optionally, such cells or transgenic animals may also be 
transformed so as to express a normal or mutant form of the p APP protein at 
relatively high levels. 

In this series of embodiments, the candidate compound is administered to 
the cell line or transgenic animals (e.g.. by addition to the media of cells in culture; or 

10 by oral or parenteral administration to an animal) and, after an appropriate period 
(e.g., 0-72 hours for cells in culture, days or months for animal models), a biological 
sample is collected (e.g., cell culture supernatant or cell lysate from cells in culture; 
tissue homogenate or plasma from an animal) and tested for the level of the long 
isoforms of the AP peptides. The levels of the peptides may be determined in an 

15 absolute sense (e.g.. nMol/ml) or in a relative sense (e.g., ratio of long to short Ap 
isoforms). The Ap isoforms may be detected by any means known in the art (e.g., 
electrophoretic separation and sequencing) but, preferably, antibodies which are 
specific to the long isoform are employed to determine the absolute or relative levels 
of the Apl.42/43 or APx.42/43 peptides. Candidate phannaceuticals or therapies 

20 which reduce the absolute or relative levels of these long AP isofomis. particularly in 
the transgenic animal models of the invention, are likely to have therapeutic utility in 

the treatment of Alzheimer's Disease, or other disorders caused by mutations in the 
presenilins or PS-interacting proteins, or by other aberrations in APP metabolism. 
Phosphorylation of Micr otubule Associated Proteins 

25 In another series of embodiments, candidate compounds may be screened 

for their ability to modulate presenilin or PS-interacting protein activity by assessing 
the effect of the compound on levels of phosphorylation of microtubule associated 
proteins (MAPs) such as tau. The abnormal phosphorylation of tau and other MAPs 
in the brains of victims of Alzheimer's Disease is well known in the art. Thus, 

30 compounds which prevent or inhibit the abnomial phosphorylation of MAPs may 
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have utility in treating presenilin or PS-interacting protein-associated diseases such as 
AD. As above, cells from normal or mutant animals or subjects, or the transformed 
cell lines and animal models of the invention may be employed. Preferred assays will 
employ cell lines or animal models transformed with a mutant human or humanized 
5 mutant presenilin or PS -interacting protein gene. The baseline phosphorylation state 
of MAPs in these cells may be established and then candidate compounds may be 
tested for their ability to prevent, inhibit or counteract the hyperphosphorylation 
associated with mutants. The phosphorylation state of the MAPs may be determined 
by any standard method known in the art but, preferably, antibodies which bind 
10 selectively to phosphorylated or unphosphorylated epitopes are employed. Such 
antibodies to phosphorylation epitopes of the tau protein are known in the art (e.g., 
ALZ50), 

10. Screening and Diagnostics for Alzheimer's Disease 

15 A. General Diagnostic Methods 

The PS-interacting genes and gene products, as well as the PS-interacting 
protein derived probes, primers and antibodies, disclosed or otherwise enabled herein, 
are useful in the screening for carriers of alleles associated with Alzheimer's Disease, 
for diagnosis of victims of Alzheimer's Disease, and for the screening and diagnosis of 

20 related presenile and senile dementias, psychiatric diseases such as schizophrenia and 
depression, and neurologic diseases such as stroke and cerebral hemorrhage, all of 
which are seen to a greater or lesser extent in symptomatic human subjects bearing 
mutations in the PSl or PS2 genes or in the APP gene. Individuals at risk for 
Alzheimer's Disease, such as those with AD present in the family pedigree, or 

25 individuals not previously known to be at risk, may be routinely screened using 

probes to detect the presence of a mutant PS-interacting protein gene or protein by a 
variety of techniques. Diagnosis of inherited cases of these diseases can be 
accomplished by methods based upon the nucleic acids (including genomic and 
mRNA/cDNA sequences), proteins, and/or antibodies disclosed and enabled herein, 

30 including functional assays designed to detect failure or augmentation of the normal 
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presenilin or PS-interacting protein activity and/or the presence of specific new 
activities conferred by mutant PS-interacting proteins. Preferably, the methods and 
products are based upon the human nucleic acids, proteins or antibodies, as disclosed 
or othenvise enabled herein. As will be obvious to one of ordinary skill in the art, 
however, the significant evolutionary conservation of large portions of nucleotide and 
amino acid sequences, even in species as diverse as humans, mice. C. elegans . and 
Drosophila . allow the skilled artisan to make use of non-human homoiogues of the 
PS-interacting proteins to produce useful nucleic acids, proteins and antibodies, even 
for applications directed toward human or other animal subjects. Thus, for brevity of 
exposition, but without limiting the scope of the invention, the following description 
will focus upon uses of the human homoiogues of PS-interacting proteins and genes. 
It will be understood, however, that homologous sequences from other species will be 
equivalent for many purposes. 

As will be appreciated by one of ordinary skill in the art, the choice of 
diagnostic methods of the present invention will be influenced by the nature of the 
available biological samples to be tested and the nature of the information required. 
Alzheimer's Disease is. of course, primarily a disease of the brain, but brain biopsies 
are invasive and expensive procedures, particularly for routine screening. Other 
tissues which express the presenilins or PS-interacting proteins at significant levels 
20 may, therefore, be preferred as sources for samples. 

B. Protein B ased Screens and Diap nnstirs 

When a diagnostic assay is to be based upon PS-interacting proteins, a variety 
of approaches are possible. For example, diagnosis can be achieved by monitoring 
differences in the electrophoretic mobility of normal and mutant proteins. Such an 
25 approach will be particularly useful in identifying mutants in which charge 
substitutions are present, or in which insertions, deletions or substitutions have 
resulted in a significant change in the electrophoretic migration of the resultant 
protein. Alternatively, diagnosis may be based upon differences in the proteolytic 
cleavage patterns of normal and mutant proteins, differences in molar ratios of the 
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various amino acid residues, or by functional assays demonstrating altered function of 
the gene products. 

In preferred embodiments, protein-based diagnostics will employ differences 
in the ability of antibodies to bind to normal and mutant PS-interacting proteins. Such 
5 diagnostic tests may employ antibodies which bind to the nomial proteins but not to 
mutant proteins, or vice versa. In particular, an assay in which a plurality of 
monoclonal antibodies, each capable of binding to a mutant epitope, may be 
employed. The levels of anti -mutant antibody binding in a sample obtained from a 
test subject (visualized by, for example, radiolabelling, ELISA or chemiluminescence) 

10 may be compared to the levels of binding to a control sample. Alternatively, 
antibodies which bind to normal but not mutant proteins may be employed, and 
decreases in the level of antibody binding may be used to distinguish homozygous 
normal individuals from mutant heterozygotes or homozygotes. Such antibody 
diagnostics may be used for in situ immimohistochemistry using biopsy samples of 

15 CNS tissues obtained antemortem or postmortem, including neuropathological 

structures associated with these diseases such as neurofibrillary tangles and amyloid 
plaques, or may be used with fluid samples such a cerebrospinal fluid or with 
peripheral tissues such as white blood cells. 

C. Nucleic Acid Based Screens and Diagnostics 

20 When the diagnostic assay is to be based upon nucleic acids from a sample, 

the assay may be based upon mRNA, cDNA or genomic DNA. When mRNA is used 
from a sample, there are considerations with respect to source tissues and the 
possibility of alternative splicing. That is, there may be little or no expression of 
transcripts imless appropriate tissue sources are chosen or available, and alternative 

25 splicing may result in the loss of some infomiation or difficulty in interpretation. 

Whether mRNA, cDNA or genomic DNA is assayed, standard methods well known in 
the art may be used to detect the presence of a particular sequence either in situ or in 
vitro (see, e.g., Sambrook et al., (1989) Molecular Cloning: A Laboratory Manual , 
2nd ed.. Cold Spring Harbor Press, Cold Spring Harbor, NY). As a general matter, 

30 however, any tissue with nucleated cells may be examined. 
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Genomic DNA used for the diagnosis may be obtained from body cells, such 
as those present in the blood, tissue biopsy, surgical specimen, or autopsy material. 
The DNA may be isolated and used directly for detection of a specific sequence or 
may be amplified by the polymerase chain reaction (PGR) prior to analysis. 
5 Similarly, RNA or cDNA may also be used, with or without PGR amplification. To 
detect a specific nucleic acid sequence, direct nucleotide sequencing, hybridization 
using specific oligonucleotides, restriction enzyme digest and mapping, PGR 
mapping, RNase protection, chemical mismatch cleavage, ligase-mediated detection, 
and various other methods may be employed. Oligonucleotides specific to particular 
10 sequences can be chemically synthesized and labeled radioactively or non- 

radioactively (e.g., biotin tags, ethidium bromide), and hybridized to individual 
samples immobilized on membranes or other solid-supports (e.g.. by dot-blot or 
transfer from gels after electrophoresis), or in solution. The presence or absence of 
the target sequences may then be visualized using methods such as autoradiography, 
15 fluorometry, or colorimetry. These procedures can be automated using redundant, 
short oligonucleotides of known sequence fixed in high density to silicon chips. 
(1) Appropriate Probes and Primers 
Whether for hybridization, RNase protection, ligase-mediated detection, PGR 
amplification or any other standards methods described herein and well known in the 
20 art, a variety of subsequences of the PS-interacting protein sequences disclosed or 
otherwise enabled herein will be useful as probes and/or primers. These sequences or 
subsequences will include both nomial sequences and deleterious mutant sequences. 
In general, usefiil sequences will include at least 8-9, more preferably 10-50, and most 
preferably 18-24 consecutive nucleotides from introns, exons or intron/exon 
25 boundaries. Depending upon the target sequence, the specificity required, and future 
technological developments, shorter sequences may also have utility. Therefore, any 
PS-interacting protein derived sequence which is employed to isolate, clone, amplify, 
identify or otherwise manipulate a PS-interacting protein sequence may be regarded as 
an appropriate probe or primer. Particularly contemplated as usefiil will be sequences 
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including nucleotide positions from the PS-interacting protein genes in which disease- 
causing mutations are known to be present, or sequences which flank these positions. 
(2) Hybridization Screening 
For insitu detection of a nonnal or mutant PS-interacting protein-related 
nucleic acid sequence, a sample of tissue may be prepared by standard techniques and 
then contacted with one or more of the above-described probes, preferably one which 
is labeled to facilitate detection, and an assay for nucleic acid hybridization is 
conducted under stringent conditions which permit hybridization only between the 
probe and highly or perfectly complementary sequences. Because many mutations 
consist of a single nucleotide substitution, high stringency hybridization conditions 
may be required to distinguish normal sequences from most mutant sequences. When 
the PS-interacting protein genotypes of the subject's parents are known, probes may 
be chosen accordingly. Alternatively, probes to a variety of mutants may be 
employed sequentially or in combination. Because most individuals carrying 
15 mutations in the PS-interacting proteins will be heterozygous, probes to normal 
sequences also may be employed and homozygous normal individuals may be 
distinguished from mutant heterozygotes by the amount of binding (e.g., by intensity 
of radioactive signal). In another variation, competitive binding assays may be 
employed in which both normal and mutant probes are used but only one is labeled. 
20 (3) Restriction Mapping 

Sequence alterations may also create or destroy fortuitous restriction enzyme 
recognition sites which are revealed by the use of appropriate enzyme digestion 
followed by gel-blot hybridization. DNA fragments carrying the site (normal or 
mutant) are detected by their increase or reduction in size, or by the increase or 
25 decrease of corresponding restriction fragment numbers. Such restriction fragment 
length polymorphism analysis (RFLP), or restriction mapping, may be employed with 
genomic DNA, mRNA or cDNA. The PS-interacting protein sequences may be 
amplified by PGR using the above-described primers prior to restriction, in which 
case the lengths of the PGR products may indicate the presence or absence of 
30 particular restriction sites, and/or may be subjected to restriction after amplification. 
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The restriction fragments may be visuahzed by any convenient means (e.g., under UV 
light in the presence of ethidium bromide). 
(4) PGR Maop inp 

In another series of embodiments, a single base substitution mutation may be 
5 detected based on differential PGR product length or production in PGR. Thus, 
primers which span mutant sites or which, preferably/have 3' temiini at mutation 
sites, may be employed to amplify a sample of genomic DNA, mRNA or cDNA from 
a subject. A mismatch at a mutational site may be expected to alter the ability of the 
nonnal or mutant primers to promote the polymerase reaction and, thereby, result in 
10 product profiles which differ between nomial subjects and heterozygous and/or 
homozygous mutants. The PGR products of the nonnal and mutant gene may be 
differentially separated and detected by standard techniques, such as polyacrylamide 
or agarose gel electrophoresis and visualization with labeled probes, ethidium 
bromide or the like. Because of possible non-specific priming or readthrough of 

15 mutation sites, as well as the fact that most carriers ofmutant alleles will be 
heterozygous, the power of this technique may be low. 
(5) ElectroDhoretic MnhiHty 

Genetic testing based on DNA sequence differences also may be achieved by 
detection of alteraUons in electrophoretic mobility of DNA, mRNA or cDNA 
fragments in gels. Small sequence deletions and insertions, for example, can be 
visualized by high r^olution gel electrophoresis of single or double stranded DNA, or 
as changes in the migration pattern of DNA heteroduplexes in non-denaturing gel 
electrophoresis. Mutations or polymorphisms in the PS-interacting protein genes may 
also be detected by methods which exploit mobility shifts due to single-stranded 
25 conformational polymorphisms (SSGP) associated with mRNA or single-stranded 
DNA secondary structures. 

(6) Ghemical Cleavage nf Mi^matrh^c 

Mutations in the PS-interacting protein genes may also be detected by 
employing the chemical cleavage of mismatch (GCM) method (see. e.g., Saleeba and 
30 Gotton, 1 993, and references therein). In this technique, probes (up to -~ 1 kb) may be 
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mixed with a sample of genomic DNA, cDNA or mRNA obtained from a subject. 
The sample and probes are mixed and subjected to conditions which allow for 
heteroduplex formation (if any). Preferably, both the probe and sample nucleic acids 
are double-stranded, or the probe and sample may be PGR amplified together, to 
5 ensure creation of all possible mismatch heteroduplexes. Mismatched T residues are 
reactive to osmium tetroxide and mismatched C residues are reactive to 
hydroxylamine. Because each mismatched A will be accompanied by a mismatched 
T, and each mismatched G will be accompanied by a mismatched C, any nucleotide 
differences between the probe and sample (including small insertions or deletions) 
10 will lead to the formation of at least one reactive heteroduplex. After treatment with 
osmium tetroxide and/or hydroxylamine to modify any mismatch sites, the mixture is 
subjected to chemical cleavage at any modified mismatch sites by, for example, 
reaction with piperidine. The mixture may then be analyzed by standard techniques 
such as gel electrophoresis to detect cleavage products which would indicate 
15 mismatches between the probe and sample, 
(7) Other Methods 
Various other methods of detecting PS-interacting protein mutations, based 
upon the sequences disclosed and otherwise enabled herein, will be apparent to those 
of ordinary skill in the art. Any of these may be employed in accordance with the 
20 present invention. These include, but are not limited to, nuclease protection assays 
(SI or ligase-mediated), ligated PGR, denaturing gradient gel electrophoresis (DGGE; 
see, e.g., Fischer and Lerman, 1983), restriction endonuclease fingerprinting 
combined with SSCP (REF-SSCP; see, e.g., Liu and Sonrmier, 1995), and the like. 
D. Other Screens and Diagnostics 
25 In inherited cases, as the primary event, and in non-inherited cases as a 

secondary event due to the disease state, abnormal processing of the presenilins, PS- 
interacting proteins, APP, or proteins reacting with the presenilins, PS-interacting 
proteins, or APP may occur. This can be detected as abnormal phosphorylation, 
glycosylation, glycation amidation or proteolytic cleavage products in body tissues or 
30 fluids (e.g., CSF or blood). 
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Diagnosis also can be made by observation of alterations in transcription, 
translation, and post-translational modification and processing, as well as alterations 
in the intracellular and extracellular trafficking of gene products in the brain and 
peripheral cells. Such changes will include alterations in the amount of messenger 
5 RNA and/or protein, alteration in phosphorylation state, abnormal intracellular 
location/distribution, abnonmal extracellular distribution, etc. Such assays will 
include: Northern Blots (e.g., with PS-interacting protein-specific and non-specific 
nucleotide probes), Western blots and enzyme-linked immunosorijent assays (ELISA) 
(e.g., with antibodies raised specifically to a PS-interacting protein or PS-interacting 
10 functional domain, including various post-translational modification states including 
glycosylated and phosphorylated isoforms). These assays can be performed on 
peripheral tissues (e.g., blood cells, plasma, cultured or other fibroblast tissues, etc.) 
as well as on biopsies of CNS tissues obtained antemortem or postmortem, and upon 
cerebrospinal fiuid. Such assays might also include in situ hybridization and 
immunohistochemistiy (to localize messenger RNA and protein to specific subcellular 
comparmients and/or within neuropathological structures associated with these 

diseases such as neurofibrillary tangles and amyloid plaques). ' 
E. Screening and Diagnostic Kits 

In accordance with the present invention, diagnostic kits are also provided 
20 which will include the reagents necessary for the above-described diagnostic screens. 
For example, kits may be provided which include antibodies or sets of antibodies 
which are specific to one or more mutant epitopes. These antibodies may, in 

particular, be labeled by any of the standard means which facilitate visualization of 
binding. Alternatively, kits may be provided in which oligonucleotide probes or PGR 

25 primers, as described above, are present for the detection and/or amplification of 
normal or mutant presenilin and/or PS-interacting protein nucleotide sequences. 
Again, such probes may be labeled for easier detection of specific hybridization. As 
appropriate to the various diagnostic embodiments described above, the 
oligonucleotide probes or antibodies in such kits may be immobilized to substrates 

30 and appropriate controls may be provided. 
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11. Methods of Treatment 

The present invention now provides a basis for therapeutic intervention in 
diseases which are caused, or which may be caused, by mutations in the PS- 
5 interacting proteins. As noted above, mutations in the hPSl and hPS2 genes have 
been associated with the development of early onset forms of Alzheimer's Disease 
and, therefore, the present invention is particularly directed to the treatment of 
subjects diagnosed with, or at risk of developing, Alzheimer's Disease. 

Without being bound to any particular theory of the invention, the effect of 

10 the Alzheimer's Disease related mutations in the presenilins appears to be a gain of a 
noyel function, or an acceleration of a nonnal function, which directly or indirectly 
causes aberrant processing of the Amyloid Preciu^or Protein (APP) into Ap peptide, 
abnormal phosphorylation homeostasis, and/or abnormal apoptosis in the brain. Such 
a gain of function or acceleration of function model would be consistent with the adult 

15 onset of the symptoms and the dominant inheritance of Alzheimer's Disease. 

Nonetheless, the mechanism by which mutations in the presenilins may cause these 
effects remains unknown. 

The present invention, by identifying a set of PS-interacting proteins, 
provides new therapeutic targets for intervening in the etiology of presenilin-related 

20 AD. In addition, as mutations in the presenilins may cause AD, it is likely that 
mutations in the PS-interacting proteins may also cause AD. The fact that the PS- 
interacting protein S5a is alternately processed in the brains of victims of sporadic 
AD, as well as in the brains of victims of presenilin-linked AD, suggests that, at the 
very least, this PS-interacting protein is involved in the etiology of AD independent of 

25 mutations in the presenilins. It is likely that the other PS-interacting proteins also 
may be involved in non-presenilin-linked AD. 

Therapies to treat PS-interacting protein-associated diseases such as AD 
may be based upon (1) administration of nonnal PS-interacting proteins, (2) gene 
therapy with normal PS-interacting protein genes to compensate for or replace the 

30 mutant genes, (3) gene therapy based upon antisense sequences to mutant PS- 
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interacting protein genes or which "knock-out" the mutant genes. (4) gene therapy . 
based upon sequences which encode a protein which blocks or corrects the deleterious 
effects of PS-interacting protein mutants. (5) immunotherapy based upon antibodies 
nornial and/or mutant PS-interacting proteins, or (6) small molecules (drugs) which 
alter PS-interacting protein expression, alter interactions between PS-interacting 
proteins and other proteins or ligands, or which otherwise block the aberrant ftinction 
of mutant presenilin or PS-interacting proteins by altering the structure of the mutant 
proteins, by enhancing their metabolic clearance, or by inhibiting their fiinction. 
A. Protein Therapy 

Treatment of Alzheimer's Disease, or other disorders resulting from PS- 
interacting protein mutations, may be performed by replacing the mutant protein with 
normal protein, by modulating the function of the mutant protein, or by providing an 
excess of nonnal protein to reduce the effect of any aberrant function of the mutant 
proteins. 

To accomplish this, it is necessary to obtain, as described and enabled 
herein, large amounts of substantially pure PS-interacting protein from cultured cell 
systems which can express the protein. Delivery of the protein to the affected brain 
areas or other tissues can then be accomplished using appropriate packaging or 
administration systems including, for example, liposome mediated protein delivery to 
20 the target cells. 

B. Gene Therapy 

In one series of embodiments, gene therapy may be employed in which 
normal copies of a PS-interacting protein gene are introduced into patients to code 
successfully for normal protein in one or more different affected cell types. The gene 
must be delivered to those cells in a form in which it can be taken up and code for 
sufficient protein to provide effective function. Thus, it is preferred that the 
recombinant gene be operably joined to a strong promoter so as to provide a high 
level of expression which will compensate for, or out-compete, the mutant proteins. 
As noted above, the recombinant construct may contain endogenous or exogenous 
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regulatory elements, inducible or repressible regulatory elements, or tissue-specific 
regulatory elements. 

In another series of embodiments, gene therapy may be employed to 
replace the mutant gene by homologous recombination with a recombinant construct. 
5 The recombinant construct may contain a normal copy of the targeted PS-interacting 
protein gene, in which case the defect is corrected in situ, or may contain a "knock- 
out" construct which introduces a stop codon, missense mutation, or deletion which 
abolished function of the mutant gene. It should be noted in this respect that such a 
construct may knock-out both the normal and mutant copies of the targeted gene in a 

10 heterozygous individual, but the total loss of gene function may be less deleterious to 
the individual than continued progression of the disease state. 

In ariother series of embodiments, antisense gene therapy may be 
employed. The antisense therapy is based on the fact that sequence-specific 
suppression of gene expression can be achieved by intracellular hybridization between 

15 mRN A or DNA and a complementary antisense species. The formation of a hybrid 
duplex may then interfere with the transcription of the gene and/or the processing, 
transport, translation and/or stability of the target mRNA. Antisense strategies may 
use a variety of approaches including the administration of antisense oligonucleotides 
or antisense oligonucleotide analogs (e.g., analogs with phosphorothioate backbones) 

20 or transfection with antisense RNA expression vectors. Again, such vectors may 
include exogenous or endogenous regulatory regions, inducible or repressible 
regulatory elements, or tissue-specific riegulatory elements. 

In another series of embodiments, gene therapy may be used to introduce a 
recombinant construct encoding a protein or peptide which blocks or otherwise 

25 corrects the aberrant function caused by a mutant presenilin or PS-interacting protein 
gene. In one embodiment, the recombinant gene may encode a peptide which 
corresponds to a domain of a PS-interacting which has been found to abnormally 
interact with another cell protein or other cell ligand (e.g., a mutant prcsenilin). Thus, 
for example, if a mutant PSl TM6->7 domain is found to interact with a PS- 

30 interacting protein but the corresponding normal TM6->7 domain does not undergo 
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this interaction, gene therapy may be employed to provide an excess of the mutant 
TM6-H.7 domain which may compete with the mutant presenilin protein and inhibit or 
block the aberrant interaction. Alternatively, the PS-interacting domain of a PS- 
interacting protein which interacts with a mutant, but not a normal, presenilin may be 
encoded and expressed by a recombinant construct in order to compete with, and 
thereby inhibit or block, the aberrant interaction. 

Retroviral vectors can be used for somatic cell gene therapy especially 
because of their high efficiency of infection and stable integration and expression. A 
fiill length PS-interacting protein gene, subsequences encoding functional domains of 
these proteins, or any of the other therapeutic peptides described above, can be cloned 
into a retroviral vector and expression may be driven from its endogenous promoter, 
from the retroviral long terminal repeat, or from a promoter specific for the target cell 
type of interest (e.g., neurons). Other viral vectors which can be used include adeno- 
associated virus, vaccinia vims, bovine papilloma virus, or a herpes virus such as 
15 Epstein-Barr virus. 

C. Immunotherapy 

immunotherapy is also possible for Alzheimer's Disease. Antibodies may 
be raised to a normal or mutant PS -interacting protein (or a portion thereoO and are 
administered to the patient to bind or block an aberrant interaction (e.g., with a mutant 
presenilin) and prevent its deleterious effects. Simultaneously, expression of the 
normal protein product could be aicouraged. Alternatively, antibodies may be raised 
to specific complexes between mutant or wild-type PS-interacting proteins and their 
interaction partners. 

A fiirther approach is to stimulate endogenous antibody production to the 
desired antigen. Administration could be in the form of a one time immunogenic 
preparation or vaccine immunization. The PS-interacting protein or other antigen may . 
be mixed with pharmaceutically acceptable carriers or excipients compatible with the 
protein. The immunogenic composition and vaccine may fiirther contain auxiliary 
substances such as emulsifying agents or adjuvants to enhance effectiveness. 
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Immunogenic compositions and vaccines may be administered parenterally by 
injection subcutaneously or intramuscularly. 

D. Small Molecule Therapeutics 

As described and enabled herein, the present invention provides for a 
5 number of methods of identifying small molecules or other compounds which may be 
useful in the treatment of Alzheimer's Disease or other disorders caused by mutations 
in the presenilins or PS-interacting proteins. Thus, for example, the present invention 
provides for rnethods of identifying proteins which bind to normal or mutant PS- 
interacting proteins (aside from the presenilins). The invention also provides for 
10 methods of identifying small molecules which can be used to disrupt aberrant 

interactions between mutant presenilins and/or PS-interacting proteins and such other 
binding proteins or other cell components. 

Examples 

15 Example 1 . Isolation of PS-interacting proteins bv two-hybrid yeast system. 

To identify proteins interacting with the presenilin proteins, a 
commercially available yeast two-hybrid kit ("Matchmaker System 2" from Clontech, 
Palo Alto, CA) was employed to screen a brain cDNA library for clones which 
interact with functional domains of the presenilins. In view of the likelihood that the 

20 TM6-^7 loop domains of the presenilins are important functional domains, partial 
cDNA sequences encoding either residues 266-409 of the normal PSl protein or 
residues i272-390 of the normal PS2 protein were ligated in-frame into the EcoRI and 
BamHI sites of the pAS2-l fusion-protein expression vector (Clontech). The resultant 
fusion proteins contain the GAL4 DNA binding domain coupled in-frame either to the 

25 TM6-j^7 loop of the PSl protein or to the TM6-^7 loop of the PS2 protein. These 
expression plasmids were co-transformed into S. cercvisiae strain Y190 together with 
a library of human brain cDNAs ligated into the pACT2 yeast fiision-protein 
expression vector (Clontech) bearing the GAM activation domain using modified 
lithium acetate protocols of the "Matchmaker System 2" yeast two-hybrid kit 

30 (Clontech, Palo Alto, CA). Yeast clones bearing human brain cDNAs which interact 
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with the TM6-.7 loop domain were selected for His- resistance by plating on SD 
minimal medium lacking histidine and for Pgal+ activation by color selection. The 
His+ Pgal+ clones were then purged of the pAS2-l "bait" construct by culture in 
10^g/ml cyclohexamide and the unknown "trapped" inserts of the human brain 
cDNAs encoding PS-interacting proteins were isolated by PCR and sequenced. Of 6 
million initial transfoimants. 200 positive clones were obtained after His- selection, 
and 42 after pgal+ color selection, carried out in accordance with the manufacturer's 
protocol for selection of positive colonies. Of these 42 clones there were several 
independent clones representing the same genes. 

To address the likelihood that mutations in the presenilins cause AD 
through the acquisition of a novel but toxic function (i.e., dominant gain of function 
mutation) which is mediated by a novel interaction between the mutant proteins and 
one or more other cellular proteins, the human brain cDNA library cloned into the 
pACT2 expression vector (Clontech) was re-screened using mutant TM6^1 loop 
domain sequences as described above and acconiing to manufacturer's protocols. In 
particular, mutant presenilin sequences coiresponding to residues 260-409 of PS I 
TM6->7 loop domains bearing mutations L286V, L392V and A290-3 19 were ligated 
in-frame into the GAL4 DNA-binding domain of the pAS2-l vector (Clontech) and 
used to screen the human brain cDNA:GAL4 activation domain library of pACT 
vectors (Clontech). Yeast were co-transformed, positive colonies were selected, and 
"trapped" sequences were recovered and sequenced as described above. In addition to 
some of the same sequences recovered with the normal TM6-j>7 loop domains, 
several new sequences were obtained which reflect aberrant interactions of the mutant 
presenilins with normal cellular proteins. 

The recovered and sequenced clones corresponding to these PS-interacting 
proteins were compared to the pubUc sequence databases using the BL. ASTN 
algorithm via die NCBI e-mail server. Descriptions of several of these clones follow: 

Antisecretorv Factor/ Pmte^^r^. g..K..„;. overiapping clones 
(Y2H29 and Y2H3I) were identified which correspond to a C-terminal fiagment of a 
protein alternatively identified as Antisecretory Factor ("ASF") or the Multiubiquitin 
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chain binding S5a subunit of the 26S proteasome ("S5a") (Johansson el al. 1995; 
Ferrell et ah, 1996). The complete nucleotide and amino acid sequences of the S5a 
subunit are available through the public databases under Accession number U51007 
and are reproduced here as SEQ ID NO: 1 and SEQ ID NO: 2. The nucleotide 
5 sequences of the Y2H29 and Y2H31 clones include nucleotides 351-1330 of SEQ ID 
NO: 1 and amino acid residues 70-377 of SEQ ID NO: 2. Thus, residues 70-377 of 
the full S5a subunit include the PS-interacting domain of this protein. Residues 206- 
377 of S5a contain certain motifs that are important for protein-protein interactions 
(Ferrell etaL, 1996), 

10 The PS 1 -S5a subunit interaction was directly re-tested for both wild type 

and mutant PSl TM6->7 loop (residues 260-409) by transforming Y187 yeast cells 
with the appropriate wild type or mutant (L286V, L392V or A290-3 1 9) cDNA ligated 
in-frame to the GAL4-DNA binding domain of pACT2. The A290-3 19 mutant fusion 
construct displayed autonomous pgal activation in the absence of any S5a "target 

15 sequence" and, therefore, could not be further analyzed. In contrast, both the L286V 
and L392V mutant constructs interacted specifically with the S5a construct. 
Quantitative assays, however, showed that these interactions were weaker than those 
involving the wild type PSl^stMOQ sequence and that the degree of interaction was 
crudely correlated with the age of onset of FAD. The difference in pgal activation 

20 was not attributable to instability of the mutant PS Ij^^^^ construct mRNAs or fusion 
proteins because Western blots of lysates of transformed yeast showed equivalent 
quantities of mutant or wild-type fusion proteins. 

Because one of the putative functions of S5a is to bind multi-ubiquitinated 
proteins, the PS 1 :S5a interaction observed in S. cerevisiae could arise either through 

25 yeast-dependent ubiquitination of the PSl^^o, construct, or by direct interaction. The 
former would reflect a degradative pathway, a functional and perhaps reciprocal 
interaction between PSl and SSa, or both. A direct interaction is favored by the fact 
that the PSl:S5a interaction is decreased rather than increased by the presence of the 
L286V and L392V mutations, and by the fact that neither of these mutations affect 

30 ubiquilin conjugation sites in the PS12«mo9 loop (i.e., K265. K31 1, K314 or K395). 
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To fiirther examine this possibility, we investigated the direct interaction of 
recombinant His-tagged fusion proteins corresponding to full length S5a and the 
PSl2«^o9 loop. Partially purified recombinant His-tagged PSU^„ loop and His- 
tagged S5a proteins and appropriate controls were mixed in phosphate buffered saline. 
The mixture was then subjected to size exclusion chromatography, and eluates were 
examined by SDS-PAGE and Western blotting using anti-His-tag monoclonal 
antibodies (Quiagen). In the crude PS1,«^„ loop preparation alone, the PS\,^ loop 
eluted from the size exclusion column as a broad peak at 35 minutes. In the crude S5a 
preparation alone. S5a eluted at 25 minutes. However, when the cnide PS1,«^ loop 
and S5a preparations were mixed, there was a significant shift in the elution of PSU^o. 
,0, toward a higher molecular weight complex. Co-elution of S5a and PSl^^^ in the 
same fixation was confimed by SDS-PAGE and Western blotting effractions using 
the anti-His-tag antibody. These results are consistent with a ubiquitin-independent 
and, therefore, possibly functional interaction. 

GT24 and related genes with homolopv t n pl2Q/r>lakop l obin family . Five 
over-lapping clones (Y2H6. Y2H10b, Y2HI7h2, Y2H24, and Y2H25) were obtained 
which interact with the normal PS 1 TM6->7 loop domain and which appear to 
represent at least one novel gene! The Y2H24 clone was also found to interact with 
the mutant PSl TM6-*7 loop domains. Note that it appears that more than one 
member of the gene family was isolated, suggesting a family of genes interacting 
differentially with different presenilins. The most complete available cDNA 
corresponding to these clones was designated GT24 and is disclosed herein as SEQ ID 
NO: 3 and has been deposited with GenBank as Accession number U81004. The 
open reading fiame suggests that GT24 is a protein of at least 1040 amino acids with a 
unique N-terminus, and considerable homology to several armadillo (ann) repeat 
proteins at its C-terminus. The predicted amino acid sequence of GT24 is disclosed 
herein as SEQ ID NO: 4. Thus, for example, residues 440-862 of GT24 have 32-56% 
identity (p=1.2e-'.") to residues 440-854 of murine p 120 protein (Accession number 
Z17804), and residues 367-815 of GT24 have 26-42% identity (p=0.001 7) to residues 
245-465 of the D. melano^aster armadillo segment polarity protein (Accession 
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nuinberP18824). The GT24 gene maps to chromosome 5p 15 near the anonymous 
microsatellite marker D5S748 and the Cri-du-Chat syndrome locus. This sequence is 
also nearly identical to portions of two human ESTs of unknown function (i.e., 
nucleotides 2701-3018 of Accession number F08730 and nucleotides 2974-3348 of 
5 Accession number Tl 8858). These clones also show lower degrees of homology with 
other partial cDNA and gDNA sequences (e.g., HI 7245, T06654, T772 14, H24294. 
M62015, T87427 and G04019). 

p0071 sene. An additional His, Pgal* clone isolated in the initial screening 
with wild type PS 1 "bait" had a similar nucleotide sequence to GT24 (target 
10 clone Y2H25; Accession number U8 1 005), and would also be predicted to encode a 
peptide with C-temiinal ami repeats. A longer cDNA sequence closely corresponding 
to the Y2H25 clone has been deposited in GenBank as human protein p0071 
(Accession number X81889). The nucleotide and corresponding amino acid 
sequences of pQ071 are reproduced herein as SEQ ID NOs: 5 and 6. Comparison of 
15 the predicted sequence of the p007 1 ORF with that of GT24 confirms that they are 
related proteins with 47% overall amino acid sequence identity, and with 70% identity 
between residues 346-862 of GT24, and residues 509- 1 022 of p007 1 (which includes 
^ residues encoded by the Y2H25 cDNA). The latter result strongly suggests that PSl 
interacts with a novel class of arm repeat containing proteins. The broad ~ 4 kb 
20 hybridization signal obtained on Northern blots with the unique 5 ' end of GT24 could 
reflect either alternate splicing/polyadenylation of GT24, or the existence of 
additional members of this family with higher degrees of N-tenninal homology to 
GT24thanp0071. 

Rabll gene. This clone (Y2H9), disclosed herein as SEQ ID NO: 7, was 
25 identified as interacting with the normal PSl TM6-»-7 loop domain and appears to 
correspond to a known gene, Rabl 1, available through Accession numbers X56740 
and X53I43. Rabl 1 is believed to be involved in protein/vesicle trafficking in the 
ER/Golgi. Note the possible relationship to processing of membrane proteins such as 
BAPP and Notch with resultant overproduction of toxic AB peptides (especially 
30 neurotoxic Afl,^«, isofonms) (Scheuner, et al, 1 995). 
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Retinoid X receptor-B gene. This clone (Y2H23b), disclosed herein as 
SEQ ID NO: 8, was identified as interacting with the normal PSl TM6-*7 loop 
domain and appears to correspond to a known gene, known variously as the retinoid X 
receptor-p, nuclear receptor co-regulator or MHC Class I regulatory element, and 
5 available through Accession numbers M84820,X63522 and M8 1766. This gene is 
believed to be involved in intercellular signaling, suggesting a possible relationship to 
the intercellular signaling function mediated by C. elegans sell 2 and Notch/lin-12 
(transcription activator). 

Cytoplasmic chaoeronin eene. This clone (Y2H27), disclosed herein as 
10 SEQ ID NO: 9, was identified as interacting with the normal PS 1 TM6->>7 loop 
domain and appears to correspond to a known gene, a cytoplasmic chaperonin 
containing TCP- 1 , available through Accession numbers Ul 7 1 04 and X74801 '. 

Unknown gene rY?Hl^) This clone (Y2H35), disclosed herein as SEQ 
ID NO: 10, was idenUfied as interacting with the normal PSI TM6->7 loop domain 
15 and appears to correspond to a known gene of unknown function, available through 
Accession number R12984, which shows conservation down through yeast. 

Unknown gene (Y2H 171). This clone (Y2H 171), disclosed herein as SEQ 
ID NO: 1 1 , was identified as interacting with the normal PS 1 TM6->7 loop domain 
and appears to correspond to a known expressed repeat sequence available through 
20 Accession number D55326. 

Unknown gene fY2H41). This clone (Y2H41) was identified which reacts 
strongly with the TM6-^7 loop domains of both PSl and PS2 as well as the mutant 
loop domains of PSl. The sequence, disclosed as SEQ ID NO: 12, shows strong 
homology to an EST of unknown function (Accession number T64843). 
Example 2. Isolation of presenili n binding proteins bv affinity chromato frraphy 

To identify the proteins which may be involved in the biochemical 
function of the presenilins. PS-interacting proteins were isolated using affinity 
chromatography. A GST-fiision protein containing the PSl TM6->7 loop, prepared 
as described in Example 3, was used to probe human brain extracts, prepared by 
30 homogenizing brain tissue by Polytron in physiological salt solution. Non-specific 
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binding was eliminated by pre-clearing the brain homogenates of endogenous GST- 
binding components by incubation with glutathione-Sepharose beads. These GST- 
free homogenates were then incubated with the GST-PS fusion proteins to produce the 
desired complexes with functional binding proteins. These complexes were then 
5 recovered using the affinity glutathione-Sepharose beads. After extensive washing 
with phosphate buffered saline, the isolated collection of proteins was separated by 
SDS-polyacrylamide gel electrophoresis (SDS-PAGE; Tris-tricine gradient gel 4- 
20%). Two major bands were observed at -14 and 20 kJD in addition to several 
weaker bands ranging firom 50 to 60 kD. 
10 The same approach may now be used to identify proteins which have 

binding activity for the PS-interacting proteins and, thereby, to further elucidate the 
etiology of AD and to identify additional therapeutics targets for intervention in AD 
and related disorders. 

Example 3. Eukarvotic and prokarvotic expression vector systems. 

15 Constructs suitable for use in eukaiyotic and prokaryotic expression 

systems have been generated using different classes of PSl nucleotide cDNA 
sequence inserts. In the first class, termed full-length constructs, the entire PSl cDNA 
sequence is inserted into the expression plasmid in the correct orientation, and 
includes both the natural 5' UTR and 3' UTR sequences as well as the entire open 

20 reading frame. The open reading frames bear a nucleotide sequence cassette which 
allows either the wild type open reading frame to be included in the expression system 
or alternatively, single or a combination of double mutations can be inserted into the 
open reading frame. This was accomplished by removing a restriction fragment from 
the wild type open reading frame using the enzymes Narl and Pflml and replacing it 

25 with a similar fragment generated by reverse transcriptase PGR and bearing the 

nucleotide sequence encoding either the M146L mutation or the H163R mutation. A 
second restriction fragment was removed from the wild type normal nucleotide 
sequence for the open reading frame by cleavage with the enzymes Pflml and Ncol 
and replaced with a restriction fragment bearing the nucleotide sequence encoding the 

30 A246E mutatioti, the A260V mutation, the A285V mutation, the L286V mutation, the 
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L392V mutation or the C410Y mutation. A third variant, bearing a combination of 
either the M146L or H163R mutation in tandem with one of the remaining mutations, 
was made by linking a Narl-PflmI fragment bearing one of the former mutations and . 
Pflml-Ncol fragment bearing one of the latter mutations. 

The second class of cDNA inserts, termed truncated constructs, was 
constructed by removing the 5' UTR and part of the 3' UTR sequences from full 
length wild type or mutant cDNA sequences. The 5' UTR sequence was replaced with 
a synthetic oligonucleotide containing a Kpnl restriction site (GGTAC/C) and a small 
sequence (GCCACC) to create a Kozak initiation site around the ATG at the 
begimiingofthePSl ORF. The 3' UTR was replaced with an oligonucleotide with an 
artificial EcoRI site at the 5' end. Mutant variants of this construct were then made by 
inserting the mutant sequences described above at the Narl-Pflml and Pslml-Ncol 
sites as described above. 

For eukaryotic expression, these various cDNA constructs bearing wild 
type and mutant sequences, as described above, were cloned into the expression 
vector pZeoSV in which the SV60 promoter cassette had been removed by restriction 
digestion and replaced with the CMV promoter element of pcDNA3 (Invitrogen). For 
priokaryotic expression, constnicts have been made using the glutathione S-transferase 
(GST) fusion vector pGEX-kg. The inserts which have been attached to the GST 
fusion nucleotide sequence are the same nucleotide sequences described above 
bearing either the normal open reading frame nucleotide sequence, or bearing a 
combination of single and double mutations as described above. These GST fusion 
constructs allow expression of the partial or full-length protein in prokaryotic cell 
systems as mutant or wild type GST fusion proteins, thus allowing purification of the 
full-length protein followed by removal of the GST fusion product by thrombin 
digestion. A farther cDNA construct was made with the GST fiision vector, to allow 
the production of the amino acid sequence corresponding to the hydrophilic acidic 
loop domain between TM6 and TM7 of the full-length protein, either as a wild type 
nucleotide sequence or as a mutant sequence bearing either the A285V mutation, the 
L286V mutation or the L392V mutation. This was accomplished by recovering wild 
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type or mutant sequence from appropriate sources of RNA using a 5* oligonucleotide 
primer with a 5* BamHI restriction site (G/GATCC), and a 3* primer with a 5' EcoRI 
restriction site (G/AATTC). This allowed cloning of the appropriate mutant or wild 
type nucleotide sequence corresponding to the hydrophilic acidic loop domain at the 
5 BamHI and the EcoRl sites within the pGEX-KG vector. 

The PS-interacting protein genes may be similarly manipulated by 
recombinant means for expression in prokaryotic or eukaryotic hosts. In particular, 
GST or other fusion proteins may be produced which will be useful in assays (e.g., 
yeast two-hybrid studies) for therapeutics. 

10 Example 4. Antibody production. 

Peptide antigens corresponding to portions of the PSl protein were 
synthesized by solid-phase techniques and purified by reverse phase high pressure 
liquid chromatography. Peptides were covalently linked to keyhole limpet 
hemocyanin (KLH) via disulfide linkages that were made possible by the addition of a 

15 cysteine residue at the peptide C-terminus of the presenilin fragment. This additional 
residue does not appear normally in the protein sequence and was included only to 
facilitate linkage to the KLH molecule. 

A total of three New Zealand white rabbits were immunized with peptide- 
KLH complexes for each peptide antigen in combination with Freund's adjuvant and 

20 were subsequently given booster injections at seven day intervals. Antisera were 
collected for each peptide and pooled and IgG precipitated with ammonium sulfate. 
Antibodies were then affinity purified with Sulfo-link agarose (Pierce) coupled with 
the appropriate peptide. This final purification is required to remove nonrspecific 
interactions of other antibodies present in either the pre- or post-inmiune serum. 

25 The specificity of each antibody was confirmed by three tests. First, each 

detected single predominant bands of the approximate size predicted for presenilin- 1 
on Western blots of brain homogenate. Second, each cross-reacted with recombinant 
fusion proteins bearing the appropriate sequence. Third each could be specifically 
blocked by pre-absorption with recombinant PSl or the inununizing peptide. 
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Antibodies to peptides derived from the PS-interacting proteins may be 
produced by similar means. 
Example 5. Transgenic mice. 

A series of wild type and mutant PS 1 and PS2 genes were constructed for 
use in the preparation of transgenic mice. Mutant versions of PSl and PS2 were 
generated by site-directed mutagenesis of the cloned cDNAs using standard 
techniques. 

The cDNAs and their mutant versions were used to prepare two classes of 
mutant and wild type PSl and PS2 cDNAs, as described in Example 3. The first 
class, referred to as "full-length" cDNAs, were prepared by removing approximately 
200 bp of the 3* untranslated region immediately before the polyA site by digestion 
with EcoRI (PS 1 ) or PvuII (PS2). The second class, referred to as "truncated" 
cDNAs, were prepared by replacing the 5* untranslated region with a ribosome 
binding site (Kozak consensus sequence) placed immediately 5' of the ATG start 
15 codon. 

Various foil length and truncated wdld type and mutant PSl and PS2 
cDNAs, prepared as described above, were introduced into one or more of the 
following vectors and the resulting constructs were used as a source of gene for the 
production of transgenic mice. 

The cos.TET expressioii vector- This vector was derived fttim a cosmid 
clone containing the Syrian hamster PrP gene. It has been described in detail by Scott 
et al. (1992) and Hsiao et al. (1995). PSl and PS2 cDNAs (foil length or truncated) 
were inserted into this vector at its Sail site. The final constructs contain 20 kb of 5' 
sequence flanking the inserted cDNA. This 5' flanking sequence includes the PrP 
25 gene promoter, 50 bp of a PrP gene 5' untranslated region exon, a splice donor site, a 1 
kb intron, and a splice acceptor site located immediately adjacent to the Sail site into 
which the PSl orPS2 cDNA was inserted. The 3' sequence flanking the inserted 
cDNA includes an approximately 8 kb segment of PrP 3' untranslated region 
including a polyadenylation signal. Digestion of this constract with Not! (PSl) or 
30 Fsel (PS2) released a fragment containing a mutant or wild type PS gene under the 
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control of the PrP promoter. The released fragment was gel purified and injected into 
the pronuclei of fertilized mouse eggs using the method of Hsiao et al. (1995). 

Platelet-derived growth factor receptor B-subunit constructs : PS cDNAs 
were also introduced between the Sail (full length PSl cDNAs) or Hindlll (truncated 
5 PSl cDNAs, fiill length PS2 cDNAs, and truncated PS2 cDNAs) at the 3' end of the 
human platelet derived grovAh factor receptor p-subunit promoter and the EcoRI site 
at the 5' end of the SV40 polyA sequence and the entire cassette was cloned into the 
pZeoSV vector (Invitrogen, San Diego, CA.). Fragments released by Scal/BamHI 
digestion were gel purified and injected into the pronuclei of fertilized mouse eggs 

10 using the method of Hsiao et al. (1995). 

Human B-ac tin constructs : PSl and PS2 cDNAs were inserted into the Sail 
site of pBAcGH. The construct produced by this insertion includes 3.4 kb of the 
human p actin 5' flanking sequence (the human p actin promoter, a spliced 78 bp 
human p actin 5' untranslated exon and intron) and the PSl or PS2 insert followed by 

15 2.2 kb of human growth hormone genomic sequence containing several introns and 
exons as well as a polyadenylation signal. Sfil was used to release a PS-containing 
fragment which was gel purified and injected into the pronuclei of fertilized mouse 
eggs using the method of Hsiao et al. ( 1 995). 

Phosphoglvcerate kinase constructs : PSl and PS2 cDNAs were introduced 

20 into the pkJ90 vector. The cDNAs were inserted between the Kpnl site downstream 
of the human phosphoglycerate kinase promoter and the Xbal site upstream of the 3* 
untranslated region of the human phosphoglycerate kinase gene. PvuII/Hindlll (PSl 
cDNAs) or PvuII (PS2 cDNAs) digestion was used to release a PS-containing 
fragment which was then gel purified and injected into the pronuclei of fertilized 

25 mouse eggs as described above. 

Analysis of AB in transgenic murine hippocampus : To analyze the effect 
of a mutant human PSl transgene in mice, a PSl mutation observed in conjunction 
with a particularly severe form of early-onset PS 1 -linked Alzheimer's disease was 
used, namely the M146L missense mutation (Sherrington et al., 1995). The animals. 

30 which were heterozygous for the PS 1 mutant transgene on a mixed FVB-C57BL/6 
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strain background, were cross-bred with similar mice bearing the human wild-type 
PAPP,„ cDNA under the same Syrian hamster PrP promoter similar to those animals 
recently described by Hsiao et al.. 1995. These cross breedings were done because it 
is thought that human Ap is more susceptible to the formation of aggregates than are 
murine AP peptides. 

The progeny of these PS x PAPP^ cross-bfeedings were then 
genotyped to identify animals that contained both the human wild-type pAPP«, 
transgene and also the mutant human PSl^u.^ transgene. These mice were aged until 
two to three months of age and then sacrificed, with the hippocampus and neocortex 
being dissected rapidly from the brain and frozen. Litter mates of these mice, which 
contained only the wild-type human PAPP,„ transgene were also sacrificed, and their 
hippocampi and neocortices were dissected and rapidly frozen as well: 

The concentration of both total AP peptides (Ap,^„ and AP^.,,,,,,) as well 
as the subset of AP peptides ending on residues 42 or 43 (long-tailed Ap„ peptides) 
were then measured using a two-sandwich ELISA as described previously (Tamaoka 
et al., 1994; Suzuki et al.. 1994). These results convincingly showed a small increase 
in total AP peptides in the double transgenic animals bearing Wild-type human 
PAPP„5 and mutant human PS1mm6l transgenes compared to the wild-type human 
PAPP^, controls. More impressively, these measurements also showed that there was 
an increase in the amount of long-tailed Ap peptides ending on residues 42 or 43 
(Ap„). In contrast, litter mates bearing only the wild-type himian pAPP^^ transgene 
had Ap« long-tailed peptide values which were below the limit of quantitation 
("BLQ"). 

These observations therefore confirm that the construction of transgenic 
animals can recapitulate some of the biochemical features of human Alzheimer's 
disease (namely the overproduction of Ap peptide and, in particular, overproduction 
of long-tailed isoforms of Ap peptide). These observations thus prove that the 
transgenic models are in fact useful in exploring therapeutic targets relevant to the 
treatment and prevention of Alzheimer's disease. 
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Analysis of hi ppocampus dependent memory functions in PSl transgenic 
niice: Fourteen transgenic C57BL/6 x FVB mice bearing the human PS1mm6v mutant 
transgene under the PrP prompter (as described) above and 12 wild type litter mates 
aged 2.5-3 months of age (both groups were balanced for age. weight, and sex) were 
5 investigated for behavioral differences attributable to the mutant transgene. Also the 
qualitative observation of murine behavior in their home cages did not indicate 
bimodal distribution of behaviors in the sample of animals. 

Experiment I . To test for subtle differences in exploratory 
behavior (e.g. locomotion, scaiming of the environment through rearing, and patterns 

10 of investigation of unfamiliar environment), both PS 1 mm6v and wild type litter mates 
were tested in the open-field (Janus, et al. 1995). The results of the test revealed no 
significant differences between transgenics and controls in exploration of a new 
environment measured by mice locomotor behaviors (walking, pausing, wall leaning, 
rearing, grooming), (F(l,24) = .98, NS). Thus, differences any in behavior on the 

15 Morris water maze test (see below) cannot be attributed to differences in locomotor 
abilities, etc. 

Experiment 2 . One week afier the open-field test, the PS1mj46v 
mutant transgenic mice and their litter mates were trained in the Morris water maze. 
In this test, a mouse has to swim in a pool in order to find a submerged escape 

20 platform. The animal solves that test through learning the location of the platform 
using the available extra-maze spatial cues (Morris, 1990). This test was chosen 
because there is strong evidence that the hippocampal formation is involved in this 
form of learning. The hippocampus is also a major site of AD neuropathology in 
humans and defects in spatial learning (geographic disorientation, losing objects, 

25 wandering, etc.) are prominent early features of human AD. As a result the test is 
likely to detect early changes equivalent to those seen in human AD. The Morris test 
is conducted in three phases. In the first phase (the learning acquisition phase), the 
mouse has to learn the spatial position of the platform. In the second phase (the probe 
trial), the platform is removed from the pool and the mouse's search for the platfonm 

30 is recorded. In the final phase (the learning transfer phase), the platform is replaced in 
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a new position in the pool, and the mouse has to leam that new spatial position of the 
platform. 

Transgenic and wild type mice did not differ in their latencies to find the 
platform during learning acquisition (F(1.24) = 0.81, NS). and both groups showed 
rapid learning across trials (F(10.15) = 11.57. p < 0.001). During the probe trial 
phase, mice fi-om both groups searched the quadrant of the pool which originally 
contained the platform significantly longer than other areas of the pool which had not 
contained the platform (F(3,22> = 28.9, p < 0.001). However, the wild type controls 
showed a trend which was not quite statistically significant (t(24) = 1.21, p = 0.24) for 
an increased number of crossings of the exact previous position of the platform. In 
the learning transfer test, both groups showed the same latency of finding the new 
position of the platform in the initial block of trials (t(24) = 1.11, NS). Such long 
latency to find the new spatial position is expected because the mice spent most of 
their time searching for the platform in the old spatial position. However, in later 
15 trials in the learning transfer phase, the wild type mice showed shorter swim latencies 
to the new position of the platforin compared to the PS1„„,, mutant transgenics 

(F(1.24)2.36,p = 0.14). TheresultsindicatethatPSlMM^vmutanttransgenicmice ; 
were less flexible in transferring learned information to a new sittiation and tended to 
persevere in their search for the platform in the old location. 
20 Thus, although no differences were found in the spontaneous exploration 

of a new environment and in the acquisition of new spatial information between the 
wild type and the PS1„.^, mutant transgenic mice, the PSl^,,, mutant transgenic 
mice were impaired in switching and/or adapting this knowledge in later simations. 

Electrophysiolopical Recordings in the hinnoc^mp nc mutant tn.n.a.ni. 
25 mice: Five to six months old litter mate control and human PS1m„,v mutant 

transgenic mice on the same C57BU6 x FVB strain backgrounds as above were used 
to study long temi potentiation (LTP) as an electrophysiologic correlate of learning 
and memory in the hippocampus. Recordings were carried out on 400 tim thick 
hippocampal slices according to conventional techniques. Briefly, brains were 
30 removed and transverse sections containing hippocampi were obtained within 1 min. 
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after mice were decapitated under halothane anesthesia. Slices were kept at room 
temperature in oxygenated artificial cerebrospinal fluid for one hour prior to 
recording. One slice at a time was transferred to the recording chamber, where they 
were maintained at 32 ''C in an interface between oxygenated artificial cerebrospinal 
5 fluid and humidified air. Slices were then allowed to equilibrate in the recording 
chamber for another hour. 

Extracellular field recordings were carried out in the CAl subfield of the 
hippocampus at the Schaeffer collateral-pyramidal cell synapse. Synaptic responses 
were induced by the stimulation of Schaeffer collaterals at a frequency of 0.03 Hz and 

10 an intensity of 30-50 % of maximal response. Tetani to evoke long-term potentiation 
consisted of 5 trains of 100 Hz stimulation lasting for 200 ms at an intertrain interval 
of 10 seconds. Field potentials were recorded using an Axopatch 200B amplifier 
(Axon Instrument). Glass pipettes were fabricated from borosilicate glass with an 
outer diameter of 1 .5 mm, and pulled with a two step Narishige puller. Data were 

15 acquired on a 486-IBM compatible computer using PCLAMP6 software (Axon 
Instrimient). 

To test for any abnormality in presynaptic function, we investigated the 
differences in paired-pulse faciUtation, which is an example of use-dependent increase 
in synaptic efficacy and is considered to be presynaptic in origin. In hippocampus, 

20 when two stimuli are delivered to the Schaeffer collaterals in rapid succession, paired- 
pulse facilitation manifests itself as an enhanced dendritic response to the second 
stimulus as the interstimulus interval gets shorter. In three pairs of wild- 
type/transgenic mice, we did not observe any difference in the paired-pulse facilitation 
over an interstimulus interval range of 20 ms to 1 sec. These data suggest that in 

25 PS 1m, 46V mutant transgenic mice, the excitability of Schaeffer collateral fibers and 
neurotransmitter release are likely to be normal. 

Tetanic stimulation induced a long-lasting increase in the synaptic strength 
in both control (n = 3) and PSl^j^^v mutant transgenic mice (n = 2). In slices obtained 
from the PSl^usv mutant transgenic mice, long-lasting increase in the synaptic 

30 strength was 30.% more than that obtained from control mice. 
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Although preferred embodiments of the invention have been described 
herein in detail, it will be understood by those skilled in the art that variations may be 
made thereto without departing from the spirit of the invention or the scope of the 
5 appended claims. 
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(1> GENERAL INTORMATION; 



(i) APPLICANT; 



ST. GEORGE-HYSLOP, PETER H 
ROMMENS, JOHANNA M. 
FRASER, PAUL E. 



'^^^^f^SrriSn^^'^i*^"; NUCLEIC ACIDS AND PROTEINS RELATED TO 
ALZHEIMER'S DISEASE, AND USES THEREFOR 



NUMBER OF SEQUENCES 

(iv) 



12 



CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sim & McBurney 

(B) . STREET: 330 University Avenue. 

(C) CITY: Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

( F) ZIP: MSG 1R7 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS /MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
(B» FILING DATE: 27-JAN-1997 
(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/592,541 

(B) FILING DATE: 26-JAN-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/021,673 

(B) FILING DATE: OS-JUL-1996 



Floor 



Version #1.30 



( vii ) 



i vii ) 



(vii ) 



( viii ) 



PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/021,700 

(B) FILING DATE: 12-JUL-1996 

PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/029,895 

(B) FILING DATE: 08-NOV-1996 

PRIOR APPLICATION DATA; 

(A) APPLICATION NUMBER: DOCKET* CAN-006PR(d) 

(B) FILING DATE: 02-JAN-1997 

ATTORNEY/AGENT INFORMATION: 
(A) NAME: RAE, Patricia A. 



( ix ) TELECOMMUNICATION INFORMATION : 

(A) TELEPHONE: (416) 595-1155 

(B) TELEFAX: (416) 595-1163 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1330 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) • FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION; 145. .1275 

(P) OTHER INFORMATION: /product* 



'S5a* 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

AATTCCCAAA TGACCTTTTA TTTCATACAG AGATACAAAG GCAACTATGT GCAGCAACAA 60 

TCTGATGGGC AGTCCAAACT CTTGGGAGGA AGTAAATTCA TGGTAAATGT CATGATGGCG 120 

GTCGGGAGGG AGGAAGGTGG CAAG ATG GTG TTG GAA AGC ACT ATG GTG TGT 171 

Met Val Leu Glu Ser Thr Met Val Cys 

1 . 5 

GTG GAC AAC AGT GAG TAT ATG CGG AAT GGA GAC TTC TTA CCC ACC AGG 219 
val Asp Asn Ser Glu Tyr Met Arg Asn Gly Asp Phe Leu Pro Thr Ara 
10 15 20 25 

CTG CAG GCC CAG CAG GAT GCT. GTC AAC ATA GTT TGT CAT TCA AAG ACC 2 67 

Leu Gin Ala Gin Gin Asp Ala Val Asn lie Val Cys His Ser Lvs Thr 
30 35 40 
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CGC AGC AAC CCT GAG AAC AAC GTG GGC CTT ATC 
Arg Ser Asn Pro Giu Asn Asn Val Giy Leu lie 
4S 50 

TGT GAA GTG CTG ACC ACA CTC ACC CCA GAC ACT 
Cys Glu Val Leu. Thr Thr Leu Thr Pro Asp ?" 

AAG CTA CAT ACT GTC CAA CCC AAG GGC AAG ATC 
Lys Leu Hxs Thr Val Gin Pro Lys Gly tys lie 
'3 80 

ATC CGC GTG GCC CAT CTG GCT CTG AAG CAC CGA 
lie Arg Val Ala His Leu Ala Leu tys His Ira 

95 log 

AAG ATG CGC ATC ATT GCC TTT GTG GGA AGC CCA 
Lys Met Arg He He Ala Phe Val Gly Ser Pro 
110 

AAG GAT CTG GTG AAA CTG GCT AAA CGC CTC AAG 
Lys Asp Leu Val Lys Leu Ala Lys Aro Leu Lys 
125 i3g 

vIT A^^ tT*^ ^^'^ "^"^^ GAG GAG GTG 

Val Asp lie He Asn Phe Giy Glu Giu Glu Val 
140 145 

ACA GCC TTT GTA AAC ACG TTG AAT GGC AAA GAT 
Thr Ala Phe Val Asn Thr Leu Asn cly tys Asp 

?J9 CC"^ CCT GGG CCC AGT TTG GCT 

Leu val Thr Vai Pro Pro Giy Pro Ser ilu All 

lao 

ler lU ?r GGT GCC ATG 

Ser Pro He Leu Ala Gly Glu Gly Gly Ala Met 
190 195 



ACT GAC TTT GAA TTT GGA GTA GAT CCC AGT GCT 
Ser Asp Phe Glu Phe Gly Val Asp Pro Ser 111 
205 210 

TTG GCC CTT CGT GTA TCT ATG GAA GAG CAT err 
Leu Ala Leu Arg Val Ser Me? Glu G?S Arg 

ctn af^ GCA GCT TCT GCT GCT 

Glu Ala Arg Arg Ala Ala Ala Ala Ser Ala Ala 

240 

Thr T^r Sk"^ '^CA GAC GAT GCC CTG 

Thr Thr Gly Thr Glu Asp Ser Asp Asp Ala Leu 

255 260 

Ser G^^n r^^ ^^C ACT GGG CTT CCT 

Ser Gin Gin Glu Phe Gly Arg Thr Gly. Leu Pro 
270 275 

SS*^ S^^ GAA GAG CAG ATT GCT TAT GCC ATC CAC 
Thr Glu Giu Glu Gin He Ala ?yr Ala Set ctn 
285 290 

ti'. s?s III if.ijsfj! i?5 u^iiitn ?n 

300 

M^t ASD Thr q^r r^^ ^AG GAG GAT 

"et Asp Thr Ser Giu Pro Ala Lys Glu Glu Asp 
JJ.3 320 

r^^ GTT CAG AGT GTC CTA GAG 

Gin Asp Pro Glu Phe Leu Gin Ser vl^ cfS 

-'^S 340 

GAT CCC AAC AAT GAA GCC ATT CGA AAT rrr arr- 
Asp Pro Asn Asn Glu Ala lie A?g Asn All Set 
-^50 355 



ACA CTG GCT AAT GAC 

Thr Leu Ala Asn Asp 
55 

GGC CGT ATC CTG TCC 

Gly Arg He Leu Ser 

ACC TTC TGC ACG GGC 
Thr Phe Cys Thr Giy 
8 5 

CAA GGC AAG AAT CAC 
Gin Gly Lys Asn His 
105 

GTG GAG. GAC AAT GAG 
Val Glu Asp Asn Giu 
120 

AAG GAG AAA GTA AAT 
Lys Giu Lys Val Asn 
135 

AAC ACA GAA AAG CTG 
Asn Thr Glu Lys Leu 

GGA ACC GGT TCT CAT 
Gl^ Thr Gly Ser His 

GAT GCT CTC ATC AGT 
Asp Ala Leu He Ser 
185 

CTG GGT CTT GGT GCC 
- GI3 
20? 



Leu Giy Leu Gly Ala 



Gin Ala Thr ttt A^n ^f"" f'^^ ^^^^ ^^G AAG 

^in Aia Thr Lys Asp Gly Lys Lys Asp Lys Lys 
-^oS 370 

TGAGACTGGA GGGAAAGGGT AGCTGAGTCT GCTTAGGGAC 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS* 

IrI iv2f'^"= ^^^"O Acids' 

(B) TYPE: ammo acid 

<D; TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 



GAT CCT GAG CTG GCC 

Asp Pro Glu Leu Ala 
215 

CAG CGG CAG GAG GAG 

Gin Arg Gin Glu Giu 

GAG GCC GGG ATT GCT 
Glu Aia Gly He Aia 
2 4 5 

CTG AAG ATG ACC ATC 
Leu Lys Met Thr He 
2 65 

GAC CTA AGC AGT ATG 
Asp Leu Ser Ser Met 
2 80 

ATG TCC CTG CAG GGA 
Met Ser Leu Gin Glv 
295 ' 

GAT GCC AGC TCA GCT 

Asp Ala Ser Ser Ala 
310 

GAT TAC GAC GTG ATG 

Asg Tyr Asp Val Met 

AAC CTC CCA GGT GTG 
Asn Leu Pro Gly Val 
345 

GGC TCC CTG GCC TCC 
Gly Ser Leu Ala Ser 
360 



GAG GAA GAC AAG AAG 
Glu Glu Asp Lys Lvs 
375 

TGCATGGGGG AATTC 



315 



363 



411 



459 



507 



555 



603 



651 



699 



747 



795 



843 



891 



939 



987 



1035 



1083 



1131 



1179 



1227 



1275 



1330 
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(xi) SEQUENCE DESCRIPTION: SEO ID NO : 2 : 

Met Val Leu Glu Ser Thr Met Val Cys Vai Asp Asn Ser Glu Tvr Mec 
I 5 10 . 15 

Arg Asn Gly Asp Phe Leu Pro Thr Acq Leu Gin Ala Gin Gin Asp Ala 
20 25 30 

Vai Asn He Val Cys His Ser Lys Thr Arg Ser Asn Pro Glu Asn Asn 
35 40 4 5 

Val Giv Leu He Thr Leu Ala Asn Asp Cys Glu Val Leu Thr Thr Leu 
50 55 60 

Thr Pro Asp Thr Gly Aro He Leu Ser Lys Leu His Thr Val Gin Pro 
65 70 75 80 

Lys Gly Lys He Thr Phe Cys Thr Gly He Arg Val Ala His Leu Ala 
85 . 90 95 

Leu Lys His Arg Gin Gly Lys Asn His Lys Met Arg He He Ala Phe 
100 105 110 

Vai Gly Ser Pro Val Glu Asp Asn Glu Lys Asp Leu Val Lys Leu Ala 
115 120 125 

Lys Arg Leu Lys Lys Glu Lys Val Asn Val Asp He He Asn Phe Gly 
130 135 140 

Glu Glu Glu Val Asn Thr Glu Lys Leu Thr Ala Phe Val Asn Thr Leu 
145 150 155 160 

Asn Gly Lys Asp Gly Thr Gly Ser His Leu Val Thr Val Pro Pro Glv 
165 170 175 

Pro Ser Leu Ala Asp Ala Leu He Ser Ser Pro He Leu Ala Gly Glu 
180 185 190 

Gly Gly Ala Met Leu Gly Leu Gly Ala Ser Asp Phe Glu Phe Gly Val 
195 200 205 

Asp Pro Ser Ala Asp Pro Glu Leu Ala Leu Ala Leu Arg Val Ser Met 
210 215 220 

Glu Glu Gin Arg Gin Arg Gin Glu Glu Glu Ala Arg Arg Ala Ala Ala 
225 230 235 240 

Ala Ser Ala Ala Glu Ala Gly He Ala Thr Thr Gly Thr Glu Asp Ser 
245 250 255 • 

Asp Asp Ala Leu Leu Lys Met Thr He Ser Gin Gin Glu Phe Glv Arq 
260 265 270 

Thr Gly Leu Pro Asp Leu Ser Ser Met Thr Glu Glu Glu Gin lie Ala 
275 280 285 

Tyr Ala Met Gin Met Ser Leu Gin Gly Ala Glu Phe Gly Gin Ala Glu 
290 295 300 

Ser Ala Asp He Asp Ala Ser . Ser Ala Met Asp Thr Ser Glu Pro Ala 
305 310 3lS 320 

Lys Glu Glu Asp Asp Tyr Asp Val Met Gin Asp Pro Glu Phe Leu Gin 
325 330- 335 

Ser. Val Leu Glu Asn Leu Pro Gly Val Asp Pro Asn Asn Glu Ala lie- 
340 345 ' 350 

Arg Asn Ala Met Gly Ser Leu Ala Ser Gin Ala Thr Lys Asp Gly Lys 
355 3 60 3 65 

Lys Asp Lys Lys Glu Glu Asp Lys Lys 
37b 375 

(2) INFORMATION FOR SEO ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3841 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . .3121 

(ix) FEATURE: 

<A) NAME/KEY: misc feature 

( B) LOCATION : 1 . . 3F4 1 

(D) OTHER INFORMATION: /note= •'GT24' 



SUBSTITUTE SHEET (RULE 26) 



wo 9inil9(i 



PCT/CA97/00051 



, . -99- 

(Xil SEQUENCE DESCRIPTION: SEQ ID N0:3: 

' ii\ i\t £15 55S i-ii 1% ir, ?s; i{; sf: m «£? gfj 

5 10 15 

s;? j:s §ts ir^ js? ;s5 =51 1%% s"; jis sff s?$ 



30 



as ill ss| S5; ?ss E?s fjs sf; sjs jf= g;s jjs s;f 
ts? Ill s.- Iff sfs i:s III fis sfs iss sfs 155 === s;s 

60 

Sfs S|| Sfs iit ijf ;jf J- ni ?5; s?| sss sfs sfs 

ifS m fiS fIS Sf; 5?S iJS if$ g;j |== fcc ... „= 

ir, in S55 n; sfs S5s ifs in in {sVfjf sfs i« s?i 

105 no 

in m IS? f?s tjs ifs n? 55= ss? m tn tit m ?;? s« 

120 125 

?SS li? SfS iSS {IS SIf srj ISS i« SSS m £IS IJ-fS 

js^ SI? n£ sss sfs ?sf SI5 gj; i« ni ns ijs ijf 
?is sts it; £is m ?ss ;is §?5 ;«£ sfj ?ss jis is? 

" 170 

sss ?££ tr. tn III in tn m in in ?;? j;; sjs ijf 

185 

^IS ?CJ txl f^^ '^^^ AGC CTG GCA 

Tyr Aia Thr Ala Thr Leu Gin Arg Pro Gly Ser Leu Ala 

200 205 

tn in in tv, tn u; ss? ^n in tn in s;s nt in tn 
in III tn tnm rn in tn tn tn tn jii i nt ?is ?;i 

ifi fJS tn tn ?JI gJS j;S - ccj jjc ;cc CTC «c cj; «c 

its i?; n.i III m tn tn tn in h nt m tn nt ss" 

265 

?sj in ss5 ni li? ni in ni t^ m sif tn ut in tn ?£; 
rnrn tn tn in tn tn in tn mm tn nt iit tn ni 

300 

tn tsf tn tn in ni tn m m tn tn ni m tn 
ilint in in tn in sir tn m tn In nt t'A m in 

330 

tn in in tn tn tn in ?s£ in tn nt m tn m ni tn 

345 350 

nt rn in j*; a; tn in in ;k if; m in tn tn in in 

CCG GAA GTG ATT CAG ATG TTG CAG CAC CAG TTT CCC TCG GTC CAG TCT 



46 



94 



142 



190 



238 



286- 



334 



382 



430 



478 



526 



574 



622 



670 



718 



766 



814 



862 



910 



958 



1006 



1054 



1 102 



1150 
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Pro Glu 



AAC zee 
Asn Ala 
385 

AAA GCC 

Lvs Ala 
400 

TTG GAT 

Leu Asp 

AGA AAC 
Arg Asn 

AAA AAC 
Lys Asn 



Val 
370 

CCA 
Ala 



GAG 
Glu 



CAT 
His 



CTG 
Leu 



-100- 

He Gin Met Leu Gin His Gin Phe Pro 



ACT GAC 
Thr Asp 
465 

TCC TCA 
Ser Ser 
480 

GTA CTG 
Val Leu 



CCT CTT 
Pro Leu 



TGT 
Cys 
4S0 

CTG 
Leu 



TGC 
Cys 



ACC 
Thr 



GCC 
Ala 



ATA 
He 



CGG 
Arg 



GTG 
Val 
435 

GGT 
Gly 



TAC 
T y r 



AGG 
Arg 

ATG 
Met 
420 

TAT 
Ty r 

GGC 
Gly 



TTG 
Leu 



AGA 
Arg 
405 

ACC 
Thr 



GGG 
Gly 



CAA CAC CTC TGT TTT GGA 
Gin His Leu Cys Phe Gly 
390 39^ 

CAA GGA GGC ATC CAG CTC 
Gin Gly Gly He Gin Leu 
410 

GAA GTC CAC CGT ACT GCC 
Glu Val His Arg Ser Ala 
425 

AAG GCC AAC GAT GAT AAC 
Lys Ala Asn Asp Asp Asn 
440 



ATC 
lie 



GAG 
Glu 



GAT 
Asp 

AAC 
Asn 



ATC 
He 



GCA 
Ala 



CAG 
Gin 



CGT AAC 
Arg Asn 



GCC CGC 
Ala Arg 
545 

TAC GTG 
Tyr Val 
560 

GTT GAA 
Val Glu 



GCA GAA 
Ala Glu 



CTA CTC 
Leu Leu 



TGG GGC 
Trp Gly 
62 5 



GCC 
Ala 
530 

AGA 
Arg 



ATC 
He 



AAC 
Asn 



ACG 
Thr 



TGT 
Cys 
610 

AAG 
Lys 



GAT 
Asp 
515 

ACC 
Thr 



AGG 
Arg 



CAG 
Gin 



TGT 
Cys 



TCT 
Ser 
595 

GGC 
Gly 



GCG 
Ala 
500 

GAT 
Asp 



GGG 
Gly 



ATG 
Met 



TCT 
Ser 



GTG 
Val 
580 

CAG 
Gin 



GAG 
Glu 



GTA GGA 
Val Gly 
640 

CTG TGG 
Leu Trp 



TGC TCA 

Cys Ser 

TTG GCT 

Leu Ala 



CGA AAA 
Arg Lys 
705 

AAT GAC 
Asn Asp 
720 

TTG GAC 
Leu Asp 



CCT 
Pro 



CAC 
His 



AAT 
Asn 



GCA 
Ala 
690 

GAG 
Glu 



CGT 
Arg 

GTC 
Val 



AAG 
Lys 



CTT 
Leu 



CCA 
Pro 



CCA 
Pro 
675 

GGG 
Gly 

AAA 
Lys 

GTG 
Val 



AAG 
Lys 



CCA 
Pro 



TCA 
Ser 
660 

GAC 
Asp 



AGC 

Ser 



GGC 
Gly 

GTG 
Val 



AGA 
Arg 



AAT 
Asn 
740 



CGG 
Arg 



CTC 
Leu 
485 

GTG 
Val 



CGG 
Arg 



TGC 
Cys 



AGA 
Arg 



GCG 
Ala 
565 

TGC 
Cys 



GGA 
Gly 



GCC 
Ala 



AAG 
Lys 



GAC 
Asp 
645 

ATA 
He 



ACG 
Thr 



TGG 
Trp 



CTG 
Leu 



TGC 
Cys 
725 

AAG 
Lys 



CCA GCA CTG GTG AGG TTA 
Pro Ala Leu Val Arg Leu 
4 5 5 

GAG CTG GTC ACA GGA GTC 
Glu Leu Val Thr Gly Val 
470 

AAA ATG CCA ATC ATC CAG 
Lys Met Pro He He Gin 
490 

ATT ATC CCC CAC TCA GGC 
He He Pro His Ser Gly 
505 

AAA ATA CAG CTG CAT TCA 
Lys He Gin Leu His Ser 
520 

CTA AGG AAT GTT AGT TCG 
Leu Ar^ Asn Val Ser Ser 

GAG TGT GAT GGG CTT ACG 
Glu Cys Asp Gly Leu Thr 
550 555 

CTG GGG AGC AGT GAG ATC 

Leu Gly Ser Ser Glu He 
570 

ATT TTA AGG AAC CTC TCG 

He Leu Arg Asn Leu Ser 
585 

CAG CAC ATG GGC ACG GAC 

Gin His Met Gly Thr Asp 

600 ^ 

AAT GGC AAG GAT GCT GAG 

Asn Gly Lys Asp Ala Glu 
615 

AAA AAG AAA TCC CAA GAT 

Lys Lys Lys Ser Gin Asp 

630 635 

TGT GCT GAA CCA CCA AAA 

Cys Ala Glu Pro Pro Lys 
650 

GTC AAA CCC TAC CTC ACA 

Val Lys Pro Tyr Leu Thr 
665 

CTG GAA GGG GCG GCA GGC 

Leu Glu Gly Ala Ala Gly 

680 ^ 

AAG TGG TCA GTA TAT ATC 
Lys Trp Ser Val Tyr He 
695 

CCC ATC CTC GTG GAG CTG 
Pro He Leu Val Glu Leu 
710 715 

GCG GTG GCC ACT GCG CTG 
Ala Val Ala Thr Ala Leu 
730 

GAG CTC ATC GGC AAA TAC 
Glu Leu He Gly Lys Tyr 
7 4 5' 



Ser Val Gin Ser 
380 

GAC AAC AAA ATT 1198 

Asp Asn Lys He 

CTG GTG GAC CTG 1246 
Leu Val Asp Leu 
415 

TGT GGA GCT CTG 1294 
Cys Gly Ala Leu 
430 

AAA ATT GCC CTG 1342 
Lys He Ala Leu 
445 

CTC CGC AAG ACG • 1390 

Leu Arg Lys Thr 
460 

CTT TGG AAC CTC 143B 
Leu Trp Asn Leu 

GAT GCC CTA GCA I486 

Asp Ala Leu Ala 
495 

TGG GAA AAT TCG 1534 

Trp Glu Asn Ser 
510 

TCA CAG GTG CTG 1582 

Ser Gin Val Leu 
525 

GCC GGA GAG GAG 1630 

Ala Gly Glu Glu 
540 

GAT GCC TTG CTG 1678 

Asp Ala Leu Leu 

GAT AGC AAG ACC 1726 
Asp Ser Lys Thr 
575 

TAC CGG CTG GCG 1774 
Tyr Arg Leu Ala 
5-9 0 

GAG CTG GAC GGG 1822 
Glu Leu Asp Gly 
605 

AGC TCT GGG TGC 1870 
Ser Ser Gly Cys 
620 

CAG TGG GAT GGA 1918 
Gin Trp Asp Gly 



GGG ATC CAG ATG 1966 
Gly He Gin Met 
655 

CTG CTC TCT GAG 2014 
Leu Leu Ser Glu 
670 

GCC CTG CAG AAC 2062 
Ala Leu Gin Asn 
685 

CGA GCC GCT GTC . 2110 

Arg Ala Ala Val 
700 

CTC CGA ATA GAC 2158 
Leu Arg He Asp 



CGG AAC ATG GCC 2206 
Arg Asn Met Ala 
735 

GCC ATG CGA GAC 2254 
Ala Met Arq Asp 
756 
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r.i 'n'ii Ill 1^,1 m '.'s{^l''t' '.'"^ 

75S ^ ^ 7?S ^^'^ Ser Asn Asn Thr Ala Ser 

/bO 



EJi sfs sif If? sjs ?5? SI? ;s; s?: s:; - - 
it: sif tii ?sf J- ill tit j;s s;; |f; n; sj; 

t;s K ;if if$ fij - - cj. ... 
s?s isi 5f; sif si; j;= in sfj i;i gjs si; ;- jiJ 



8" 830 



jf^ iit ;;i ei^ - p. 
St: s?s j;i sit tn i;? „c ... 

in S5? J« IJf .CC CC. „C .JC ,CC CJC COC 

I'jl f?5 s:s its 1;- s;^ sj; tfc =c. jc, cc; cc= oj. .jc 



8 90 - 



ti; ts; £:s tr. ft: tst „c .=c ..c 

910 



2302 
2350 
2398 
2446 
2494 
254 2 
2590 
2638 



2734 



til ?;; sj; g?; sj. .„ ,cc „. 
ait ?SJ .;;i st: jj; ?n i?? ;n i;; js; us ;;i ss: isi 
m m r.i imti III JJ5 St; «s sts sn is; ?;; 
III s;: its its sss ts; ;;t £jt s;; 1{= {gc j.c c.c „. 

its it: is; ?5; j;^ ;;i |.; -c c.c 

985 



28 30 



2878 



3022 
3070 
31.18 



5t; sti s;s ;;; f;s s;s »s; its fj;;s; si; sf; sis ifj^fis t;; 
s;; ts; ir,„Jts ;?; sii «s f:;;;; is? s;i s;s s=| jsV?;; tsi 
it: Eis^Jts ;;i iti fs; tss^st; ?;; ;;s =;; is;J;s°sts i;; m 

JTj ,«.a.cc.o .occAcsc. CTccccG... c.=,.c.Tarcc.,=c.T.c 

10 4 0- ■ . J i / 1 

CACAAGACAT TTCTTTCTGT TTTGGTTTTT TTCTCrrrrs 

GTTrrn^.^^ ^ITTTT TTCTCCTGCA AATTTAGTTT GTTAAAGCCT 

GTTCCATAGG AAGGCTGTGA TAACCAGTAA GGGAAATATT AAGAGCTATT TTAGAAAGCT 

:zz::: — — — - 

' G g" G« """"" "-"-Cr TAGGAGTAAC GAGAAGTGCT TTATACGAA 

r CGAGKCGAGG CAXTCGGGCC GGTGGGGCGT AAGGGTTATC 

GTTAAGCACA AGACACAGAA TAGTTTACAC ACTGTGTGGG GGACGGCTTC TCACGCTTTG 

TTTACTCTCT TCATCCCTTG TGACTCTAGC CTTCAGGTTG CATTGGGOTT CCTC.G l 

CC..GA.GX. .C„GCC.„ XGX.AA.GCA „G..GXAAA C.A„.GA.G A 

ATTAAAGAAG NAAAGCGCCT TGTGTATATT ACACCAATNC CGCCGTGTTT CCXCATCXA. 

CGTTCTAAAT ATTGCTTCAA XTTCKAACTT TTGAAAGATG TATGCATTTC CAGTTTTTCX 



3231 

3291 

3351 

3411 

34 7 1 

3 531 

3591 

3651 

3711 

3 77 1 
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TTACTTTCTC CCAGTATGTT TTAACCNMMN AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



AAAACTCGAG 



(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1040 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Ser Gin Leu Pro Ala Arg Gly Thr Gin Ala Arg Xaa Thr Gly Gin Ser 
15 10 15 

Phe Ser Gin Gly Thr Thr Ser Arg Ala Gly His Leu Ala Gly Pro Glu 
20 25 30 

Pro Ala Pro Pro Pro Pro Pro Xaa Pro Arg Glu Pro Phe Ala Pro Ser 
35 40 45 

Leu Glv Ser Ala Phe His Leu Pro Asp Ala Pro Pro Ala Ala Ala Ala 
50 55 60 

Ala Ala Leu Tyr tyr Ser Xaa Ser Thr Leu Pro Ala Pro Pro Arg Gly 
65 70 75 80 

Gly Ser Pro Leu Ala Ala Pro Gin Gly Gly Ser Pro Thr Lys Leu Gin 
85 90 95 

Arg Gly Gly Ser Ala Pro Glu Gly Ala Thr Tyr Ala Ala Pro Arg Gly 
100 105 110 

Ser Ser Pro Lys Gin Ser Pro Ser Arg Leu Ala Lys Ser Tyr Ser Thr 
115 120 125 

Ser Ser Pro lie Asn lie Val Val Ser Ser Ala Gly Leu Ser Pro lie 
130 135 140 

Arg Val Thr Ser Pro Pro Thr Val Gin Ser Thr lie Ser Ser Ser Pro 
145 150 155 160 

lie His Gin Leu Ser Ser Thr He Gly Thr Tyr Ala Thr Leu Ser Pro 
165 170 175 

Thr Lys Arg Leu Val His Ala Ser Glu Gin Tyr Ser Lys His Ser Gin 
180 185 190 

Glu Leu Tyr Ala Thr Ala Thr Leu Gin Arg Pro Gly Ser Leu Ala Ala 
195 200 205 

Gly Ser Arg Ala Ser Tyr Ser Ser Gin His Gly His Leu Gly Pro Glu 
210 215 220 

Leu Arg Ala Leu Gin Ser Pro Glu His His He Asp Pro lie Tyr Glu 
225 230 235 240 

Asp. Arg Val Tyr Gin Lys Pro Pro Met Arg Ser Leu Ser Gin Ser Gin 
245 250 255 

Gly Asp Pro Leu Pro Pro Ala His Thr Gly Thr Tyr Arg Thr Ser Thr 
260 265 J ^ 270 

Ala Pro Ser Ser Pro Gly Val Asp Ser Val Pro Leu Gin Arg Thr Gly 
275 280 285 

Ser Gin His Gly Pro Gin Asn Ala Ala Ala Ala Thr Phe Gin Arg Ala 
290 295 300 

Ser Tyr Ala Ala Gly Pro Ala Ser Asn Tyr Ala Asp Pro Tyr Arg Gin 
305 310 315 320 

Leu Gin Tyr Cys Pro Ser Val Glu Ser Pro Tyr Ser Lys Ser Gly Pro 
325 330 335 

Ala Leu Pro Pro Glu Gly Thr Leu Ala Arg Ser Pro Ser He Asp Ser 
340 .345 350 

He Gin Lys Asp Pro Arg Glu Phe Gly Trp Arg Asp Pro Glu Leu Pro 
355 360 365 

Glu Val He Gin Met Leu Gin His Gin Phe Pro Ser Val Gin Ser Asn 
370 375 380 

Ala Ala Ala Tyr Leu Gin His Leu Cys Phe Gly Asp Asn Lys He Lvs 
385 390 395 400 

Ala Glu He Arg Ara Gin Gly Gly He Gin Leu Leu Val Asp Leu Leu 




410 



415 
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Asp Hxs Arg Met Thr Glu Val His Arc Scr Ala Cys Gly Ala Leu Arg 
Asn Leu Val Tyr Gly Lys Ala Asn Asp Asp Asn Lys lie Ala Leu Lys 

Asn C^s Gly Gly He Pro Ala Leu Val Arg Leu Leu Arg Lys Thr Thr 

HDO 460 

AS^ Leu Glu He Arg Glu Leu Val Thr Gly Val Leu Trp Asn Leu Ser 

4 7 5 ^80 

ser cys Asp Ala Leu Lys Met Pro He lie Gin Asp Ala Leu Ala Val 

490 495 

Leu Thr Asn Ala Val He He Pro His Ser Gly Trp Glu Asn Ser Pro 

Leu Gin Asp Asp Arg Lys He Gin Leu His Ser Ser Gin Val Leu Arg 



Asn Ala Thr Gly Cys Leu Arc Asn Val Ser Ser Ala Gly Glu Glu Ala 

3 J 3 5 4 0 

Ar| Ar, Arg Met Arg Glu Cys Asp Gly Leu Thr Asp Ala Leu Leu Tyr 

555 560 

val He Gin Ser Ala Leu Gly Ser Ser Glu He Asp Ser Lys Thr Val 

-503 570 

Glu Asn cys Val Cys He Leu Arg Asn Leu Ser Tyr Arg Leu Ala Ala 
^o" 585 590 

Glu Thr Ser Gin Gly Gin His Met Gly Thr Asp Glu Leu Asp Gly Leu 

600 605 
Leu C^s Gly Glu Ala Asn Gl^ Lys Asp Ala Glu Ser Ser Gly Cys Trp 

Gl^ Lys Lys Lys Lys Lys Lys Lys Ser Gin Asd Gin Trp Asp Gly Val 

635 640 

Gly Pro Leu Pro Asp Cys Ala Glu Pro Pro Lys Gly He Gin Met Leu 

650 655 

Trp His Pro ser He Val Lys Pro Tyr Leu Thr Leu Leu Ser Glu Cys 

665 670 

Ser Asn Pro Asp Thr Leu Glu Glv Ala Ala Gly Ala Leu Gin Asn Leu 

bO u 58 5 

Ala Ala Gly Ser Trp Lys Tr| Ser Val Tyr He Arg Ala Ala Val Arg 

L^l Glu Lys Gly Leu Pro He Leu Val Glu Leu Leu Arg He Asp Asn 

715 720 

Asp Arg val Val Cvs Ala Val Ala Thr Ala Leu Arg Asn Met Ala Leu 

730 735 

Asp Val Arg Asn Lys Glu Leu He Gl^ Lys Tyr Ala Met Arg Asp Leu 

Val His Arg Leu Pro Gly Gly Asn Asn Ser Asn Asn Thr Aia Ser Lys 
'^^ 760 765 

Ala Met Ser Asp Asp Thr Val Thr Ala Val Cys C^s Thr Leu His Glu 

val He Thr Lys Asn Met Glu Asn Ala Lys Ala Leu Arg Asp Ala Gly 

800 

Gly He Glu Lys Leu Val Gly He Ser Lj-s Ser Lys Gly Asp Lys His 

ser Pro Lys Val Val Lys Ala Ala Ser Gin Val Leu Asn Ser Met Trp 

825 830 
Gin Tyr Arg Asp Leu Arg Ser Leu Tyr Lys Lys Asp Gl^ Trp Ser Gin 

Tyr His Phe Val Ala Ser Ser Ser Thr He Glu Arg Asp Arg Gin Arg 

Pro Tyr Ser Ser Ser Aro Thr Pro Ser He Ser Pro Val Arg Val Ser 

880 

pro Asn Asn Arg Ser Ala Ser Ala Pro Ala Ser Pro Arg Glu Met He 

. o 90 8 95 

ser Leu Lys Glu Arg Lys Thr Asp Tvr Glu Cys Thr Gly Ser Asn Ala 

905 910 
Thr Tyr His Gly Gly Lys Gly Glu His Thr Ser Arg L^s Asp Ala Met 
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Thr Ala Gin Asn Thr Gly lie Ser Thr Leu Tyr Arg Asn Ser Tyr Gly 

Ala. Pro Ala Glu Asp lie Lys His Asn Gin Val Ser Ala Gin Pro Val 

955 960 

Pro Gin Glu Pro Ser Arg Lys Asp Tyr Glu Thr Tyr Gin Pro Phe Gin- 
Soi 970 

Asn Ser Thr Ara Asn Tyr Asp Glu Ser Phe Phe Glu Asp Gin Val His 
ysu 985 990 

His Arg Pro Pro Ala Ser Glu Tvr Thr Met His Leu Gly Leu Lys Ser 

1000 1005 

^^"^ ?n^fn^^" '^^'^ '^^P fU^ "^y^ ^^"^ Arg Pro Tyr Ser Glu 

iU-*-^ 1015 1020 

Leu Asn Tyr Glu Thr fer His Tyr Pro Ala Ser Pro Asp Ser Trp Val 
^°25 1030 1035 ^ 1040 

(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3907 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 142. .3777 

(D) OTHER INFORMATION: /note« "P0071- 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

CTACTGTTGT TTTTGAGGGG CGGGCAGCCG CGCCGCCGCG GCACTTTTTT AATTTTTTCG 60 

GGTGCCGCAG CAGCGACCCC TCGGCGCCGA TGTCCCTGAT CCCTGGAGCG ACGACGGCCG 120 

CTGCCTAAGC TGGGAAGAGG A ATG CCA GCT CCT GAG CAG GCC TCA TTG GTG 17 1 

Met Pro Ala Pro Glu Gin Ala Ser Leu Val 

1 . 5 10 

GAG GAG GGG CAA CCA CAG ACC CGC CAG GAA GCT GCC TCC ACT GGC CCA 219 
Glu Glu Gly Gin Pro Gin Thr Arg Gin Glu Ala Ala Ser Thr Gly Pro 
15 20 . 25 

r?^ 2*^2 ACC ACT ATT CTA GCA TCC -GTG AAG 267 

Gly Met Glu Pro Glu Thr Thr Ala Thr Thr lie Leu Ala Ser Val Lys 
30 35 40 

r^?. rt^ f"^"^ 13"^ ^'^^ ACC CGA GAA CTG GAA GTG GAA 315 

Glu Gin Glu Leu Gin Phe Gin Arg Leu Thr Arg Glu Leu Glu Val Glu 
45 50 55 

AGG CAG ATT GTT GCC AGT CAG CTA GAA AGA TGT AGG CTT GGA GCA GAA 363 

60 ^-5 ^'^y 



31n Leu Glu Arg Cys Arg Leu Gly Ala Glu 
65 70 



TCA CCA AGC ATC GCC AGC ACC AGC TCA ACT GAG AAG TCA TTT CCT TGG 411 

Ser Pro Ser lie Ala Ser Thr Ser Ser Thr Glu Lys Ser Phe Pro Trp 
^5 80 85 9b 

AGA TCA ACA GAC GTG CCA AAT ACT GGT GTA AGC AAA CCT AGA GTT TCT 

Arg Ser Thr Asp Val Pro Asn Thr Gly Val Ser Lys Pro Arg Val Ser 

95 100 105 



160 165 110 

AAG GCA GAC AAC AGA CAG CAG CAT TCA TTC ATA GGA TCA ACT AAC AAC 

Lys Ala Asp Asn Ara Gin Gin His Ser Phe lie Gly Ser Thr Asn Asn 

1'5 180 , 185 

CAT GTG GTG AGG AAT TCA AGA GCT GAA GGA CAA ACA CTG GTT CAG CCA 

His Val Val Arg Asn Ser Arg Ala Glu Gly Gin Thr Leu Val Gin Pro 



.459 



507 



icS w^? T^'^ ^'^^ ATC AGG ACA GAG CCA • GAA CAA 

Asp Ala Val Gin Pro Asn Asn Tyr Leu He Arg Thr Glu Pro Glu Gin 

110 115 120 

r?S tkS l^^ l^^ TCT CTC CAT GAA AGT GAG GGA 555 

Gly Thr Leu Tyr Ser Pro Glu Gin Thr Ser Leu His Glu Ser Glu Gly 

125 130 135 ^ 

TCA TTG GGT AAC TCA AGA AGT TCA ACA CAA ATG AAT TCT TAT TCC GAC 603 

Ser Leu Gly Asn Ser Arg Ser Ser Thr Gin Met Asn Ser Tyr Ser Asp 

140 145 

^21 r^'^ l^^ S^'A GCA GGG AGT TTC CAC AAC AGC CAG AAC GTG AGC 651 

Ser Gly Tyr Gin Glu Ala Gly Ser Phe His Asn Ser Gin Asn Val Ser 

160 ICC ^ ^ 



699 



747 
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TCA GTA 
Ser Val 



GCA CAG 
Ala Gin 
220 

GGG TCT 
Gly Ser 
235 

ACC GAC 
Thr Asp 



GCC 
Ala 
205 

TCT 
Ser 



190 

AAT CGG GCC ATG 
Asn Arg Ala Met 



CTG 
Leu 



CCC 
Pro 



CCT GCT 
Pro Ala 



ACA GCT 
Thr Ala 



CCC AAC 
Pro Asn 
300 

CCA CTG 
Pro Leu 
.315 

GGC CAG 
Gly Gin 

CCA CAG 
Pro Gin 



GCA 
Ala 



ATA 
lie 
285 

GGA 
Gly 



CCT TCT TAT GTT 
Pro Ser Tyr Val 
225 

AGA ACT TCT CTG 
Arg Thr Ser Leu 
240 

CGA CCT CTG AAC 
Arg Pro Leu Asn 
25 5 

CGG GCA GCC TCT 
Arg Ala Ala Ser 
270 

CGG CGG ATT GGG 
Arg Arg He Gly 



-105- 

195 

AGA AGA GTT AGT TCA GTT 
Arq Arg Val Ser Ser Val 
210 215 

ATC AGC ACA GGC GTG TCT 
He Ser Thr Gly Val Ser 
230 

GGT AGT GGA TTT GGC TCT 
Gly Ser Gly Phe Gly Ser 
245 

CCC AGT GCA TAT TCC TCC 
Pro Ser Ala Tyr Ser Ser 
260 



ACC 
Thr 



GTG 
Val 



CAT 
His 



CAA TTC 
Gin Phe 



AGG CCA 
Arg Pro 
380 

AGT CAG 
Ser Gin 
395 

ATT ACT 
He Thr 



GGA 
Gly 
365 

GAC 
Asp 



CTT 
Leu 



CCT 
Pro 



CCA ACC CCT CAA 
Pro Thr Pro Gin 
305 

CTG ACG GAT GCA 
Leu Thr Asp Ala 
320 

GGG TCG TCG TCC 
Gly Ser Ser Ser 
335 

CTG GGA CCT TCA 
Leu Gly Pro Ser 
350 

CAG CAG CAG TAT 
Gin Gin Gin Tyr 



CCG TAC TCA CAG AGA 
Pro Tyr Ser Gin Arg 
275 ^ 

GTC ACC TCC CGG 
Val Thr Ser Arg 

CAA ACC ACC GCC 
Gin Thr Thr Ala 
310 

ACT CGA GTA GCT 
Thr Arg Val Ala 
325 

AAA CGC TCA GGG 
Lys Arg Ser Gly 
340 

CAA AGG ACT GTT 
Gin Arg Thr Val 
355 

ATT TAT GAG AGG 
He Tyr Glu Arg 



TCA 
Ser 
290 

TAC. 
Tyr 



CAG 
Gin 



CCC 
Pro 



CTG 
Leu 



CCC 
Pro 



CAG 
Gin 
295 

AGA 
Arg 

TCC 
Ser 



200 

CCA TCT AGA 
Pro Ser Arg 

CCT TCA AGG 
Pro Ser Arg 

CCG TCA GTG 
Pro Ser Val 
250 

ACC ACA TTA 
Thr Thr Leu 
265 

GCC TCC CCA 
Ala Ser Pro 
280 

ACC TCC AAT 
Thr Ser Asn 



ATG 
Met 



CAT 
His 



AGC 
Ser 



GGG 
Gly 



ATA 
He 



AGC CCA 
Ser Pro 



TAT CGC 
Tyr Arg 

CAA CGA 
Gin Arg 
460 

ACA GCT 
Thr Ala 
475 

GAG TGC 
Glu Cy s 



AAC 
Asn 



ACA 
Thr 
445 

AGT 
Ser 



CAT 
His 
430 

GGT 
Gly 



CTG ACA GGC 
Leu Thr Gly 
385 

CAA GAC CTT 
Gin Asp Leu 
.400 

TAT GAG GGG 
Tyr Glu Gly 
415 

GGA ACT GTG 
Gly Thr Val 



GAC 
Asp 
370 

TTA 
Leu 



CGT 
Arg 

AGG 
Arg 

GAG 
Glu 



CGG AGT TCC TAT 
Arg Ser Ser Tyr 
390 

TCT GCC GTG TCT 
Ser Ala Val Ser 
405 

ACC TAT TAC AGC 
Thr Tyr Tyr Ser 



Tyr 
420 



ATG 
Met 
375 

GCT 
Ala 



CCC 
Pro 



CCA 
Pro 



GTG GGG TCC 
Val Gly Ser 

CCA TCC CAA 
Pro Ser Gin 
330 

ACC GCC GTA 
Thr Ala Val 
345 

GAC ATG GAG 
Asp Met Glu 
360 

GTT CCA CCC 
Val Pro Pro 



AGT CAG CAT 
Ser Gin His 



GAC TTG CAC 
Asp Leu His 
410 

GTG TAC CGC 
Val Tyr Arg 



Tyr 
425 



CTC CAA GGA TCG 
Leu Gin Gly Ser 
435 ' 



GTA TCA GGT 
Val Ser Gly 



ACC ACA 
Thr Thr 



TTT GCC 
Phe Ala 



CAC CAG 
His Gin 
540 

CTG TGC 
Leu Cys 
555 



ACC CTT ACA TAC 
Thr Leu Thr Tyr 
465 

ACC TAC GCG GAG CCC 
Thr Tyr Ala Glu Pro 
480 

TAT AAC AGG CTT 
Tyr Asn Arg Leu 
495 

TCC CCA TCA ATA 
Ser Pro Ser He 
510 

CGT GAT CCT GAG 
Arg Asp Pro Glu 



AAT 
Asn 



AGA 
Arg 



TGG 
Trp 
525 

TTC 
Phe 



TTT 
Phe 



CCA TCT GTT CAG 
Pro Ser Val Gin 
545 

GGT GAC AAC AAA 
Gly Asp Asn Lys 
560 



ATT GGA AAT CTA CAA 
He Gly Asn Leu Gin 
450 . 

CAA AGA AAT AAT TAT 
Gin Arg Asn Asn Tyr 
470 

TAC' AGG CCT ATA CAA 
Tyr Arg Pro He Gin 
485 

CAG CAT GCA GTG CCG 
Gin His Ala Val Pro 
500 

GAC AGC ATT CAG AAG 
Asp Ser He Gin Lys 
515 ^ 

TTG CCT GAG GTC ATT 
Leu Pro Glu Val He 
530 

GCA AAT GCA GCG GCC 
Ala Asn Ala Ala Ala 
550 

GTG AAG ATG GAG GTG 
Val Lys Met Glu Val 
565 



CAG 
Gin 



AGG 
Arg 
455 

GCT 
Ala 



ACG GCG TTG 

Thr Ala Leu 
440 

ACA TCC AGC 

Thr Ser Ser 



CTG AAC ACA 
Leu Asn Thr 



TAC 
Tyr 

GCT 
Ala 



GAC 
Asp 



CAC 
His 
535 

TAC 
Tyr 



CGA GTG CAA 
Arg Val Gin 
490 

GAT GAT GGC 
Asp Asp Glv 
505. 

CCC AGG GAG 

Pro Arg Glu 
520 

ATG CTT GAG 

Met Leu Glu 



CTG CAG CAC 
Leu Gin His 



TGT 

Cys 



GGA ATC AAG CAT CTG GTT GAC CTT CTG GAC CAC AGA GTT 



AGG TTA GGG 
Arg Leu Gly 
576 

TTG GAA GTT 



795 
843 
891 
939 
987 
1035 
1083 
1131 
1179 
1227 
1275 
1323 
1371 
1419 
1467 
1515 
1563 
1611 
1659 
1707 
1755 
1803 
1851 
1899 
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Gly lie Lys His 



CAG AAG AAT 
Gin Lys Asn. 



ACA GAT GAA 
Thr Asp Glu 
605 

TTG TTG CGA 
Leu Leu Ar g 
620 

GTT ACA GGA 
Val Thr Gly 
635 

ACA ATC ATT 
Thr lie He 



GCT 
Ala 
590 

AAT 
Asn 



CTG 
Leu 



GTT 
Val 



CGA 
Arg 



CCA CAT TCT 
Pro His Ser 



AAA TTT CAG 
Lys Phe Gin 
685 

AAC CTC ACG 
Asn Leu Thr 
700 

GAG GGG CTG 
Glu Gly Leu 
715 

ACA TCC GAT 
Thr Ser Asp 

AGG AAC CTG 
Arg Asn Leu 

CTG GGA CTG 
Leu Gly Leu 
765 

AAA GAC TCT 
Lys Asp Ser 
780 

AGG ACT CCG 
Arg Thr Pro 
795 

CTG TCG AAG 
Leu Ser Lys 



GGA 
Gly 
670 

ACT 
Thr 



TCC 
Ser 



GTA 
Val 



TAG 
Ty r 

TCC 
Ser 
750 

AAC 
Asn 



GAG 
Glu 



CAA 
Gin 



-106- 

Leu Val Asp Leu Leu Asp His Arg Val Leu Glu Val 
575 580 585 

TGT GGT GCC CTT CGA AAC CTC GTT TTT GGC AAG TCT 
Cys Gly Ala Leu Arg Asn Leu Val Phe Gly Lys Ser 
595 606. 

AAA ATA GCA ATG AAG AAT GTT GGT GGG ATA CCT GCC 
Lys He Ala Met Lys Asn Val Gly Gly He Pro Ala 
610 615 

TTG AGA AAA TCT ATT GAT GCA GAA GTA AGG GAG CTT 
Leu Arg Lvs Ser He Asp Ala Glu Val Arg Glu Leu 
625 630 ' 

CTT TGG AAT TTA TCC TCA TGT GAT GCT GTA- AAA ATG 
Leu Trp Asn Leu Ser Ser Cys Asp Ala Val Lys Met 

645 650 

GAT GCT CTC TCA ACC TTA ACA AAC ACT GTG ATT GTT 
Asp Ala Leu Ser Thr Leu Thr Asn. Thr Val He Val 
655 660 665 

TGG AAT AAC TCT TCT TTT GAT GAT GAT CAT AAA ATT 
Trp Asn Asn Ser Ser Phe Asp Asp Asp His Lys He 
675 680 

TCA CTA GTT CTG CGT AAC ACG ACA GGT TGC CTA AGG 
Ser Leu Val Leu Arg Asn Thr Thr Gly Cys Leu Ara 
690 695 

GCG GGG GAA GAA GCt CGG AAG CAA ATG CGG TCC TGC 
Ala Gly Glu Glu Ala. Arg Lys Gin Met Arg Ser Cys 
705 710 

GAC TCA CTG TTG TAT GTG ATC CAC ACG TGT GTG AAC 
Asp Ser Leu Leu Tyr Val He His Thr Cys Val Asn 
^20 725 730 

GAC AGC AAG ACG GTG GAG AAC TGC GTG TGC ACC CTG 
Asp Ser Lys Thr Val Glu Asn Cys Val Cys Thr Leu 
7 35 740 745 

TAT CGG CTG GAG CTG GAG GTG CCC CAG GCC CGG TTA 
Tyr Arg Leu Glu Leu Glu Val Pro Gin Ala Arg Leu 
755 760 

GAA TTG GAT GAC TTA CTA GGA AAA GAG TCT CCC AGC 
Glu Leu Asp Asp Leu Leu Gly Lys Glu Ser Pro Ser 
770 775 

CCA AGT TGC TGG GGG AAG AAG AAG AAA AAG AAA AAG 
Pro Ser Cgs Trp Gly Lys Lys L^s Lys Lys Lys Lys 

GAA GAT CAA TGG GAT GGA GTT GGT CCT ATC CCA GGA 
Glu Asp Gin Trp Asp Gly Val Gly Pro He Pro Gly 
flnn gj^j 



805 



TCC 
Ser 



GTA AAA CCA 
Val Lys Pro 

TTG GAA GGC 
Leu Glu Gly 
845 

AAG TTT GCA 
Lys Phe Ala 
660 

CCC ATC CTT 
Pro He Leu 
875 

TCC GGT GCA 
Ser Gly Ala 



GAG CTC ATA 
Glu Leu He 



GGC GGC AAT 
Gly Gly Asn 
925 

TGC TGT GCT 
Cys Cys Ala 
940 



TAT 
Tyr 
630 

TCT 
Ser 



GCA 
Ala 



GTG 
Val 



ACA 
Thr 



GGC 
Gl y 
910 

GGC 
Gly 



CTG 
Leu 



CCC AAA GGG GTT GAG ATG CTG TGG CAC CCA TCG GTG 

Pro Lys Gly Val Glu Met Leu Trp His Pro Ser Val 

015 820 825 

CTG ACT CTT CTA GCA GAA AGT TCC AAC CCA GCC ACC 

Leu Thr Leu Leu Ala Glu Ser Ser Asn Pro Ala Thr 

835 840 

GCA GGG TCT CTC CAG AAC CTC TCT GCT AGC AAC TGG 

Ala Gly Ser Leu Gin Asn Leu Ser Ala Ser Asn Trp 
850 855 

TAT ATC CGG GGC GGC CGT CCG AAA AGA AAA GGG CTC 

Tyr He Arg Gly Gly Arg Pro Lys Arg Lys Gly Leu 
865 870 

GAG CTT CTG AGA ATG GAT AAC GAT AGA GTT GTT TCT 

Glu Leu Leu Arg Met Asp Asn Asp Arg Val Val Ser 

880 885 890 

GCC TTG AGG AAT ATG GCA CTA GAT GTT. CGC AAC AAG 

Ala Leu Arg Asn Met Ala Leu Asp Val Arg Asn Lys 

895 900 905 

AAA TAC GCC ATG CGA GAC CTG GTC AAC CGG CTC CCC 

Lys Tyr Ala Met Arg Asp Leu Val Asn Arg Leu Pro 

915 920 

CCC AGT GTC TTG TCT GAT- GAG ACC ATG GCA GCC ATC 

Pro Ser Val Leu Ser Asp Glu Thr Met Ala Ala He 
930 935 

CAC GAG GTC ACC AGC AAA AAC ATG GAG AAC GCA AAA 

His Glu Val Thr Ser Lys Asn Met Glu Asn Ala Lvs 
945 950 



1947 



1 995- 



204 3 



2091 



2139 



2187 



2235 



2283 



233 1 



2 37 9 



2427 



2 475 



2523 



2571 



2619 



2667 



2715 



2763 



2811 



28 59 



2907 



2955 



3003 
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III iit ssf is; ill "s "s "° ?S5 1:; 

965 

ifS 5?S S5 tli ^1 i;i s^f sif 

995 

SI? .^^^ ^tc ccc acc „t t.t ... 

990 ^ Q$? Acg Ser He Tyr Lys 

1000 



ifs s^|;!S Sf5 ISJ^S?: is; if? n; ISI^?SS ;ss 

k; ess !is is; sis if? sss i^s'isi iss is; 5s; 

^^'^S 1050 

s;^ £i; n: §f; ;rs;s5 s;s ffi m isi^it: ?js j;; ;?s -ss at 

1060 1065 

15; r.i s:f ij; -s if. cc. .c. ... ..c 

Hi Vyi fs:^ifs ISS sss tv. f?Ms; ss; ni j;s ns^tsMss 
iJf j|f„ss; sfj .-s; gji s;;^;;: ;s; sfs s:; i;sj;Ts;s sts £:s 
tsF ijs Kjj'jss jj; ts; ;;s ;n i;; s - ;s| 
ns ;?? IIS is|^ssi sti sss ?;i intn ss; ii: s;jj;s 
Jf; sn Sfs III is; ss: isi ?si s- ;;s is; ?s; gfs i;; lir^is 

ij; i£f ?SS^?S; ?;i SIf j;i iss jj-s; cc, 

??; S5g7s; s?: ifs ?;s fss^is? iss ss; tn vi^l^^ii j;s s;; 
If;;;; sj; i;s ;s|^!;s isi iis in si: ins us ss; iis 

1205 1210 
AGG TGA AAAGTCCATC TTGCTGATTT CATGATTGAA ATGTCAAAGT GAAGTGGAAG 



305 1 
3099 
3147 
. 3195 
324 3 
3291 
3339 
3387 
34 35 
3483 
3531 
3579 
3627 
3675 
3723 
3771 
3827 



GAATGAATGA AGTGTGTTTT TTTTTCCTTT TTGAGGAATT ATCAGGGGAA TTCGATATCA 
AGCTTATCGA TACCGTCGAC 

(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS- 

(HI TYPE: amino acid 
(D) TOPOLOGY: linear 

(iil MOLECULE TYPE: protein 

(Xij SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Met Pro Ala. Pro Glu Cln Ala Ser Leu Val Glu Glu Gly Gin Pro Gin 



15 



Thr Arg Gin Clu Ala Ala Ser T.r Glv Pro Gly Met Glu Pro Glu The 

^ ^ 30 . 

Th. .xa r.r rnr Xle Leu Ala Ser Val .y. ciu Gin Clu r.eu Cln Phe 

Cln A.g Leu Thr Ar, Clu Leu Glu Val Glu .r. Gin val Ala Ser 

6 0 



3887 
3907 
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Gin Leu Glu Arg Cys Arg Leu Gly Ala Glu Ser pro Ser lie Ala Ser 
65 70 75 80 

Thr Ser Ser Thr Glu Lys Ser Phe Pro Trp Arg Ser Thr Asp Val Pro 
85 90 9S 

Asn Thr Gly Val Ser Lys Pro Arg Val Ser Asp Ala Val Gin Pro Asn 
100 105 110 

Asn Tyr Leu He Arg Thr Glu Pro Glu Gin Gly Thr Leu Tyr Ser Pro 
115 120 125 

Glu Gin Thr Ser Leu His Glu Ser Glu Gly Ser Leu Gly Asn Ser Aro 
130 . 135 140 

Ser Ser Thr Gin Met Asn Ser Tyr Ser Asp Ser Gly Tyr Gin Glu Ala 
145 .150 155 160 

Gly Ser Phe His Asn Ser Gin Asn Val Ser Lys Ala Asp Asn Arg Gin 
165 170 175 

Gin His Ser Phe He Gly Ser Thr Asn Asn His Val Val Arg Asn Ser 
180 185 190 

Arg Ala Glu Gly Gin Thr Leu Val Gin Pro Ser Val Ala Asn Arg Ala 
195 200 205 

Met Arg Arg Val Ser Ser Val Pro Ser Arg Ala Gin Ser Pro Ser Tyr 
210 215 220 

Val He Ser Thr Gly Val Ser Pro Ser Arg Gly Ser Leu Arg Thr Ser 
225 230 235 240 

Leu Gly Ser Gly Phe Gly Ser Pro Ser Val Thr Asp Pro Arg Pro Leu 
245 250 255 

Asn Pro Ser Ala Tyr Ser Ser Thr Thr Leu Pro Ala Ala Arg Ala Ala 
260 265 270 

Ser Pro Tyr Ser Gin Arg Pro Ala Ser Pro Thr Ala He Arg Arg lie 
275 2B0 285 . 

Gly Ser Val Thr Ser Arg Gin Thr Ser Asn Pro Asn Gly Pro Thr Pro 
290 295 300 

Gin Tyr Gin Thr Thr Ala Arg Val Gly Ser Pro Leu Thr Leu Thr Asp 
305 310 315 320 

Ala Gin Thr Arg val Ala Ser Pro Ser Gin Gly Gin Val Gly Ser Ser 
325 330 . 335 

Ser Pro Lys Arg Ser Gly Met Thr Ala Val Pro Gin His Leu Gly Pro 
340 345 350 

Ser Leu Gin Arg Thr Val His Asp Met Glu Gin Phe Gly Gin Gin Gin 
355 360 365 

Tyr Asp He Tyr Glu Arg Met Val Pro Pro Arg Pro Asp Ser Leu Thr 
375 375 380 

Gly Leu Arg Ser Ser Tyr Ala Ser Gin His Ser Gin Leu Gly Gin Asp 
385 390 395 400 

Leu Arg Ser Ala Val Ser Pro Asp Leu His He Thr Pro He Tyr Glu 
4 0 5 410 4 15 

Gly Arg Thr Tyr Tyr Ser Pro Val Tyr Arg Ser Pro Asn His Gly Thr 
420 425 430 

Val Glu Leu Gin Gly Ser Gin Thr Ala Leu Tyr Arg Thr Gly Val Ser 
435 440 445 

Gly He Gly Asn Leu Gin Arg Thr Ser Ser Gin Arg Ser Thr Leu Thr 
450 455 460 

Tyr Gin Arg Asn Asn Tyr Ala Leu Asn Thr Thr Ala Thr Tyr Ala Glu 
465 470 475 480 

Pro Tyr Arg Pro He Gin Tyr Arg Val Gin Glu Cys Asn Tyr Asn Arg 
485 490 495 

Leu Gin His Ala Val Pro Ala Asp Asp Gly Thr Thr Arg Ser Pro Ser 
500 505 510 

He Asp Ser He Gin Lys Asp Pro Arg Glu Phe Ala Trp Arg Asp Pro 
515 520 525 

Glu Leu Pro Glu Val He His Met Leu Glu His Gin Phe Pro Ser Val 
530 535 . 540 

Gin Ala Asn Ala Ala Ala Tyr Leu Gin His Leu Cys Phe Gly Asp Asn 
545 550 555 560 

Lys Val Lys Met Glu Val Cys Arg Leu Gly Gly He Lys His Leu Val 
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565 570 575 

Asp Leu Leu Asp His Arg VaX Leu Glu Val Gin Lys Asn Ala Cys Gly 
3o0 585 590 

Ala Leu Arg Asn Leu Val Phe Glv Lys Ser Thr Asp Glu Asn Lys lie 

600 605 

.610 ^^'^ ^^"^ Arg 

LvE Ser lie Asp Ala Glu. Val Arg Glu Leu Val Thr Gly Val Leu Trp 

oJO 635 640 

Asn Leu Ser Ser Cys Asp Ala Val Lys Met Thr He He Arg Asp Ala 

650 655 

Leu Ser Thr Leu Thr Asn Thr Val lie Val* Pro His Ser Gly Trp Asn 

665 670 

Asn Ser Ser Phe Asp Asp Asp His Lys He Lys Phe Gin Thr Ser Leu 

680 685 

Val Leu Arg Asn Thr Thr Gly Cys Leu Arg Asn Leu Thr Ser Ala Gly 

695 700 

Glu Glu Ala Arg Lys Gin Met Arg Ser- Cys Glu Gly Leu Val Asp Ser 

-^10 715 . 720 

Leu Leu Tyr Val lie His Thr Cys Val Asn Thr Ser Asp Tyr Asp Ser 
'25 730 735 

Lys Thr Val Glu Asn Cys Val Cys Thr Leu Arg Asn Leu Ser Tyr Arg 
740 745 750 

Leu Glu Leu Glu Val Pro Gin Ala Arg Leu Leu Gly Leu Asn Glu Leu 
755 760 765 

Asp Asp Leu Leu Gly Lys Glu Ser Pro Ser Lys Asp Ser Glu Pro Ser 
770 775 780 

C^s Trp Gly Lys Lys L^s Lys Lys Lys Lys Ar| Thr Pro Gin Glu Asg 

Gin Trp Asp Gly Val Gly Pro He Pro Gly Leu Ser Lys Ser Pro Lys 
805 810 815 

Gly Val Glu Met Leu Trp His Pro Ser Val Val Lys Pro Tyr Leu Thr 
o20 825 830 

Leu Leu Ala Glu Ser Ser Asn Pro Ala Thr Leu Glu Gly Ser Ala Gly 
835 B40 845 

Ser Leu Gin Asn Leu Ser Ala Ser Asn Trp Lys Phe Ala Ala Tyr He 

855 860 

Arc Gly Gly Arg Pro Lys Arg Lys Gly Leu Pro He Leu Val Glu Leu 

875 880 

Leu Arg Met Asp Asn Asp Arg Val Val Ser . Ser Gly Ala Thr Ala Leu 
885 890 895 

Arg Asn Met Ala Leu Asp Val Arg Asn Lys Glu Leu He Gly Lys Tyr 
sou 905 9i5 ' 

Ala Met Arq Asp Leu Val Asn Arg Leu Pro Gly Gly Asn Gly Pro Ser 

920 92 5 

val Leu Ser Asp Glu Thr Met Ala Ala He Cys Cys Ala Leu His Glu 

935 940 

Val Thr Ser Lys Asn Met Glu Asn Ala Lys Ala Leu Ala Asp Ser Gly 

950 955 ggg 

Gly He Glu Lys Leu Val Asn He Thr Lys Gly Arg Gly Asp Arg Ser 
yoo 970 975 

Ser Leu Lys Val Val Lys Ala Ala Ala Gin Val Leu Asn Thr Leu Trp 
980 985 99Q 

Gin Tyr Aro Asp Leu Arg Ser lie Tyr Lys LysAsp Glv Trp Asn Gin. 

1000 1005 

Asn His^Phe He Thr Pro Val^Ser Thr Leu Glu Arg^Asp Arg Phe Lys 

Ser^His Pro Ser Leu Ser Thr Thr Asn Gin Gin Met Ser Pro He He 

1035 1040 
Gin ser Val Gly Ser^Thr Ser Ser Ser Pjo^Ala Leu Leu Gly Ile^Arg 

Asp Pro Arg Ser^Glu Tyr Asp Arg Thr Gin Pro Pro Met Gin Tyr Tyr 



1070 
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Asn Ser Gin Gly Asp Ala Thr His Lys Gly Leu Tyr Pro Gly Ser Ser 
^ " 1080 1085 

rS?o'" \oU^"" tVoo^''' ""^^ ''^^ ^'^^ 

Gin Asn Arg Arg Leu Gin His Gin Gin Leu Tyr Tyr Ser Gin Asp Asp 

Ser Asn Arg Lys Asn Phe Asp Ala Tyr Arg Leu Tyr Leu Gin Ser Pro 
1125 1130. 1135 

His Ser. Tyr Glu Asp Pro Tyr Phe Asp Asp Arg Val His Phe Pro Ala 
AiflU 1145. 1150 

Ser Thr Asp Tyr Ser Thr Gin. Tyr Gly Leu Lys Ser Thr Thr Asn Tyr 
1-133 1160 1165 

Val As^^Phe Tyr Ser Thr L^s^Arg Pro Ser Tyr Arg^Ala Glu Gin Tyr 

Pro Gly ser Pro Asp Ser Trp Val Tyr Asp Gin Asp Ala Gin Gin Arg 
^^.^^ 11^<> 1195 1260 

Asn Ser Phe Phe Leu Thr Leu Phe Arg Leu Arq 
1205 .1210 

(2) INFORMATION FOR SEQ ID N0:7: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 970 base pairs 

(B) TYPE: nucleic acid 
to STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ix) FEATURE: 

{A) NAME/KEY: misc feature 
(B) LOCATION: 1 . . 970 

(D) OTHER INFORMATION: /note- "Y2H9*' 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 



GAATTCCCAC 


AGATACCACT 


GCTGCTCCCG 


CCCTTTCGCT 


CCTCGGCCGC 


GCAATGGGCA 


60 


CCCGCGACGA 


CGAGTACGAC 


TACCTCTTTA 


AAGTTGTCCT 


TATTGGAGAT 


TCTGGTGTTG 


120 


GAAAGAGTAA 


TCTCCTGTCT 


CGATTTACTC 


GAAATGAGTT 


TAATCTGGAA 


AGCAAGAGCA 


160 


CCATTGGAGT 


AGAGTTTGCA 


ACAAGAAGCA 


TCCAGGTTGA 


TGGAAAAACA 


ATAAAGGCAC 


240 


AGATATGGGA 


CACAGCAGGG 


CAAGAGCGAT 


ATCGAGCTAT 


AACATCAGCA 


TATTATCGTG 


300 


GAGCTGTAGG 


TGCCTTATTG 


GTTTATGACA 


TTGCTAAACA 


TCTCACATAT 


GAAAATGTAG 


3 60 


AGCGATGGCT 


GAAAGAACTG 


AGAGATCATG 


CTGATAGTAA 


CATTGTTATC 


ATGCTTGTGG. 


420 


GCAATAAGAG 


TGATCTACGT 


CATCTCAGGG 


CAGTTCCTAC 


AGATGAAGCA 


AGAGCTTTTG 


480 


CAGAAAAGAA 


TGGTTTGTCA 


TTCATTGAAA 


CTTCGGCCCT 


AGACTCTACA 


AATGTAGAAG 


540 


CTGCTTTTCA 


GACAATTTTA 


ACAGAGATTT 


ACCGCATTGT 


TTCTCAGAAG 


GAAATGTCAG 


600 


ACAGACGCGA 


AAATGACATG 


TCTCCAAGCA 


ACAATGTGGT 


TCCTATTCAT 


GTTCCACCAA 


660 


CCACTGAAAA 


CAAGCCAAAG 


GTGCAGTGCT 


GTCAGAACAT 


CTAAGGCATT 


TCTCTTCTCC 


720 


CCTAGAAGGC 


TGTGTATAGT 


CCATTTCCCA 


GGTGTSASAT 


TTAAATATAW 


TTGTAATTCT 


760 


TGTGTCACTT 


TTGTGTTTTA 


TTACTTCATA 


CTTATGAATT 


TTTCCATGTC 


CTAAGTCTTT 


640 


TGATTTTGMT 


TTATAAAATC 


ATCCACTTGT 


NCCGAATGNC 


TGCAGCTTTT 


TTTCATGCTA 


900 


TGGCTTCACT 


AGCCTTAGTT 


TNATAAACTG 


AATGTTTGGA 


TTCCTCCCCC 


CAAAAAAAAA 


960 


AAAACTCGAG 












970 


(2) INFORMATION FOR SEQ ID N0:8: 











(i) SEQUENCE CHARACTERISTICS: 

{A} LENGTH: 264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: inisc feature 

( B) LOCATION : 1 . . 2^4 

(D) OTHER INFORMATION: /note- •*Y2H23b" 
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(xi) SEQUENCE DESCRIPTION: SEQ Id'^N0:8: 
GAATTCGCGG CCGNGTCGAC CCCCCACCCC CGATGCCACC ACCCCCANTG GGNTCTCCCN 
NCCCAGTCAT CAGTTCTTCC ATGGNGTNCC CTGGTCTGCC CCCTCCAGCT CCCCCAGGCN 
TTCTCCGGGT CTGNCAGCAG CCNCCAGATT AACTCAACAG TGTCACTCCC TGGGGGTGGG 
TCTGGNCCCC CTGANGATGT GAAGCCACCA GTCTNAGNGG TCCGGGGTCT GTACTGTCCA 
CCCCCTCCAG GTGGACCTGG CGCT 
C2) INFORMATION .FOR SEQ ID NO : 9 : 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 base pairs 
JB) TYPE: nucleic l?iS^^" 

(C) STRANDEDNESS : sinole 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

!n! i'iSF''*^^^* "^isc feature 

(B) LOCATION: I . . 3T0 

(D) OTHER INFORMATION: /note- -y2H27- 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
GAATTCGCGG CCGCGTCGAC CGCGGTCGCG TCGACCTGTT GCCCAGGCCC TAGAGGTCAT 
TCCTCGTACC CTGATCCAGA ACTGTGGGGC CAGCACCATC CGTCTACTTA CCTCCCTTCG 
GGCCAAGCAC ACCCAGGAGA ACTGTGAGAC CTGGGGTGTA AATGGTGAGA CGGGTACTTT 
GGTGGACATG AAGGAACTGG GCATATGGGA GCCATTGGCT GTGAAGCTGC AGACTTATAA 
GACAGCAGTG GAGACGGCAG TTCTGCTACT GCGAATTGAT GACATCGTTT CAGGCCACAA 
AAAGAAAGGC GATGACCAGA GCCGGCAAGG CGGNGCTCCT 
(2) INFORMATION FOR SEQ ID N0:10: 
(i) SEQUENCE CHARACTERISTICS' 

Ib! T??r^"-' ^?^^^se pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1 . . 4D4 

(D) OTHER INFORMATION: /note- •'Y2H35- 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10- 
GAATTCGCGG TCGCGTCGAC GGTTAGTCCC A.CTGGNCGC A ^ TCGAGGGNTT CACCAACGTC 
ATGGAGCTGT ATGGCANGAT CGCCGAGGTC TTCCNCCTGC CAACTGCCGA GGTGATGTTC 
TGCACCCTGA NCACCCACAA AGTGGACATN GACAAGCTCC TGGGGGGCCA GATCGGGCTG 
GAGGACTTCA TCTTCGCCCA CGTGAAGGGG YAGCGCAAGG AGGTGGAGGT GTTCAWGTCG 
GAGGATGYAC TCGGKCTCAC CATCACGGAC AACGGGGCTG GCTACGCTTC CATCAAGCGC 
ATCAAGGAGG GCAGCGTGAT CGACCACATC CACCTCATCA GCGTGGGCGA CATGATCGAG 
GCCATTAACG GGCAGAGCTT CCTGGGCTGC CGGCATTACG AGGT 
(2) INFORMATION FOR SEQ ID N0:11: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 350 base pairs 
B) TYPE: nucleic acid 
C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1..350 

(D) OTHER INFORMATION: /note= '*Y2H17l- 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11- 
GAATTCGCGG CCGCGTCGAC AAAAAAAGTA AAAGG AACTC ' GGC AA ATCTT ACCCCGCCTG 
TTTACCAAAA ACATCACCTC TAGCATCACC AGTATTAGAG GCACCGCCTG CCCAGTGACA 



60 
120 
180 
240 
264 



60 
120 
180 
240 
300 
340 



.60 
120 
180 
240 
300 
360 
404 



60 
120 
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CATGTTTAAC GGCCGCGGTA CCCTAACCGT GCAAAGGTAG CATAATCACT 



TGTTCCTTAA 



GTAGGGACCT GTATGAATGG CTCCACGAGG GTTCAGCTGT CTCTTACTTT 



TAACCARTGA 



240 



AATTGACCTG CCCGTGAAGA GGCGGGCATG ACACAGCAAG ACGAGAAGAC 



CCTATGGAGC- 



300 



TTTAATTTAT TAATGCAAAC AGTACCTAAC AAACCCACAG GGTCCTAAAC 



• 350 



(2) INFORMATION FOR SEQ ID N0:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 350 base pairs 

(B) TYPE: nucleic acid 
{CJ STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 1 . . 350 

(D) OTHER INFORMATION: /note- "Y2H41- 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12; 



GAATTCGCGG 


NCGCGTCGAC 


AGATAATGAA 


AAAACCAGAG 


GTTCCCTTCT 


TTGGTCCCCT 


60 


NNNNGATGGT 


GCTATTGTGA 


ATGGAAAGGT 


TCTACCCATT 


ATGGTTAGAG 


CAACAGCTAT 


120 


AAATGCAAGC 


CGTGCTCTGA 


AATCTCTGAT 


TCCATTGTAT 


CAAAACTTCT 


ATGAGGAGAG 


180 


AGCACGATAC 


CTGCAAACAA 


TTGTCCAGCA 


CCACTTAGAA 


CCAACAACAT 


TTGAAGATTT 


240 


TGNAGCACAG 


GTTTTTTCTC 


CAGCTCCCTA 


CCACCATTTA 


CCATCTGATG 


CCGTTGGCTC 


300 


CTACCCAGAG 


ATTCTACCCA 


GTGAAAACTC 


CCACAGCAAC 


GCAGGTAGGA 




350 
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CLAIMS 

What is claimed is: 

1 . An isolated nucleic acid comprising a nucleotide sequence encoding at 

least a presenilin-interacting domain of a presenilin-interacting protein selected from 
the group consisting of a mammalian S5a (approximately residues 70-377 of SEQ ID 
NO: 2), GT24 (approximately residues 346-862 of SEQ ID NO: 4), p0071 
(approximately residues 509-1022 of SEQ ID NO: 6). Rabl 1 (SEQ ID NO: 7). 
retinoid X receptor-p (SEQ ID N0:8), cytoplasmic chaperonin (SEQ ID NO: 9), 
Y2H35(SEQIDNO: 10), Y2H171 (SEQ ID NO: 11), and a Y2H41 (SEQ ID NO: 
12) presenilin-interacting domain. 



10 2. An isolated nucleic acid comprising a nucleotide sequence of at least 1 0 

consecutive nucleotides selected from the group consisting of SEQ ID NO: 1 , SEQ ED 
NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 
10, SEQ ID NO: II. SEQ ID NO: 12, GenBank Accession Numbers F08730, T18858, 
X81889, X56740, X53143, M84820. X63522, M81766, U17104, X74801, R12984. 
D55326, and T64843, and a sequence complementary to any of these sequences. 



15 



.3. ^isolated nucleic acid as in claim 2 comprising a nucleotide sequence of 

at least 1 5 consecutive nucleotides selected from said group. 

20 4. An isolated nucleic acid as in claim 2 comprising a nucleotide sequence of 

at least 20 consecutive nucleotides selected from said group. 
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5. An isolated nucleic acid comprising a nucleotide sequence encoding an 
antigenic detenninant of a presenilin-interacting protein selected from the group 
consisting of a manmialian S5a, GT24, p0071 , Rab 11 , retinoid X receptor-p, 
cytoplasmic chaperonin, Y2H35, Y2H171 , and Y2H41 protein. 

5 

6. A method for identifying allelic variants or heterospecific homologues of a 
human presenilin-interacting protein gene comprising 

choosing a nucleic acid probe or primer capable of hybridizing to a human 
presenilin-interacting protein gene sequence under stringent hybridization conditions; 
10 mixing said probe or primer with a sample of nucleic acids which may 

contain a nucleic acid corresponding to said variant or homologue; 

detecting hybridization of said probe or primer to said nucleic acid 
corresponding to said variant or homologue. 

15 7. A method as in claim 6 wherein said sample comprises a sample of nucleic 

acids selected from the group consisting of human genomic DNA, human mRNA, and 
human cDNA. 

8. A method as in claim 6 wherein said sample comprises a sample of nucleic 
20 acids selected from the group consisting of mammalian genomic DNA, mammalian 

mRNA, and mammalian cDNA. 

9. A method as in claim 6 wherein said sample comprises a sample of nucleic 
acids selected from the group consisting of invertebrate genomic DNA, invertebrate 

25 mRNA, and invertebrate cDNA. 
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1 0. A method as in claim 6 further comprising the step of isolating said nucleic 

acid corresponding to said variant or homologue. 



5 11. A method as in claim 6 wherein said nucleic acid is identified by 

hybridization. 



10 



12. A method as in claim 6 wherein said nucleic acid is identified by PGR 

amplification. 



13. A method for identifying allelic variants or heterospecific homologues of a 

human preseniiin-interacting protein gene comprising 

choosing an antibody capable of selectively binding to a human 
preseniiin-interacting protein; 
^5 mixing said antibody with a sample of proteins which may contain a 

protein corresponding to said variant or homologue; 

detecting binding of said antibody to said protein corresponding to said 
variant or homologue. 



20 14. A method as in claim 13 wherein said sample comprises a sample of 

proteins selected from the group consisting of human proteins, human fusion proteins, 
and proteolytic fragments thereof 
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15. A method as in claim 1 3 wherein said sample comprises a sample of 

proteins selected from the group consisting of mammalian proteins, mammalian 
fusion proteins, and proteolytic fragments thereof. 



1 6. A method as in claim 13 wherein said sample comprises a sample of 

proteins selected from the group consisting of invertebrate proteins, invertebrate 
fusion proteins, and proteolytic fragments thereof. 



17. A method as in claim 1 3 further comprising the step of substantially 
10 purifying said protein corresponding to said variant or homologue. 

18. An isolated nucleic acid comprising an allelic variant or a heterospecific 
homologue of a human presenilin-interacting protein gene. 

15 19. An isolated nucleic acid encoding an allelic variant or heterospecific 

homologue of a human presenilin-interacting protein. 

20. An isolated nucleic acid comprising a recombinant vector including a 
nucleotide sequence of any one of claims 1-19. 

20 

21 . An isolated nucleic acid as in claim 20 wherein said vector is an 
expression vector and said presenilin-interacting protein nucleotide sequence is 
operably joined to a regulatory region. 
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22. An isolated nucleic acid as in claim 2 1 wherein said expression vector 

express said presenilin-lnteracting protein sequence in mammalian cells. 



23. An isolated nucleic acid as in claim 22 wherein said cells are selected from 

the group consisting of fibroblast, liver, kidney, spleen, bone manxjw and neurological 



cells. 



24. An isolated nucleic acid as in claim 2 1 wherein said vector is selected from 

10 the group consisting of vaccinia virus, adenovims, retrovirus, neurotropic viruses and 
Herpes simplex. 



25 . An isolated nucleic acid as in claim 2 1 wherein said expression vector 

encodes at least a presenilin-interacting domain of a presenilin-interacting protein 
selected from the group consisting of a mammalian S5a, GT24, p0071, Rabl 1, 
retinoid X receptor-p, cytoplasmic chaperonin, Y2H35, Y2H171, and Y2H41 protein. 



26. An isolated nucleic acid as in claim 2 1 wherein said vector further 

comprises sequences encoding an exogenous protein operably joined to said 
presenilin-interacting protein sequence and whereby said vector encodes a presenili 
interacting protein fusion protein. 



27. An isolated nucleic acid as in claim 26 wherein said exogenous protein is 

selected from th£ group consisting of lacZ, trpE, maltose-binding protein, a poly-His 
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tag, glutathione-S-transferase, a GAL4-DNA binding domain, and a GAL4 activation 
domain. 

28. An isolated nucleic acid comprising a recombinant expression vector 

5 including nucleotide sequences corresponding to an endogenous regulatory region of a 
presenilin-interacting protein gene. 

29. An isolated nucleic acid as in claim 28 wherein said endogenous 
regulatory region is operably joined to a marker gene. 

10 

30. A host cell transformed with an expression vector of any one of claims 20- 
29, or a descendant thereof. 

31. A host cell as in claim 30 wherein said host cell is selected from the group 
15 consisting of bacterial cells and yeast cells. 

32. A host cell as in claim 30 wherein said host cell is selected from the group 
consisting of fetal cells, embryonic stem cells, zygotes, gametes, and germ line cells. 

20 33. A host cell as in claim 30 wherein said cell is selected from the group 

consisting of fibroblast, liver, kidney, spleen, bone marrow and neurological cells. 
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34. A host cell as in claim 30 wherein said cell is an invertebrate cell. 



35. A non-human animal model for Alzheimer's Disease, wherein a genome of 

said animal, or an ancestor thereof, has been modified by at least one recombinant 
construct, and wherein said recombinant construct has introduced a modification 
selected from the group consisting of (1) insertion of nucleotide sequences encoding 
at least a functional domain of a heterospecific normal presenilin-interacting protein. 
(2) insertion of nucleotide sequences encoding at least a functional domain of a 
heterospecific mutant presenilin-interacting protein, (3) insertion of nucleotide 
sequences encoding at least a functional domain of a conspecific homologue of j 
heterospecific mutant presenilin-interacting protein, and (4) inactivation of s 
endogenous presenilin-interacting protein gene. 



; a 

an 



36. An animal as in claim 35 wherein said modification is insertion of a 

nucleotide sequence encoding at least a functional domain of a normal human 
presenilin-interacting protein selected from the group consisting of a mammalian S5a, 
GT24, p0071, Rabl 1, retinoid X receptor-p, cytoplasmic chaperonin. Y2H35, 
Y2H1 71, and Y2H41 protein. 



37. An animal as in claim 35 wherein said modification is insertion of a 

nucleotide sequence encoding at least a functional domain of a mutant human 
presenilin-interacting protein selected from the group consisting of a mammalian S5a, 
GT24, p0071, Rabl 1, retinoid X receptor-p, cytoplasmic chaperonin, Y2H35, 
Y2H1 71, and Y2H41 protein. 
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38. An animal as in claim 35 wherein said animal is selected from the group 

consisting of rats, mice, hamsters, guinea pigs, rabbits, dogs, cats, goats, sheep, pigs, 
and non-human primates. 



5 39. An animal as in claim 35 wherein said animal is an invertebrate. 

40, A method for producing at least a functional domain of a presenilin- 
interacting protein comprising culturing a host cell of any of claims 30-34 under 
suitable conditions to produce said presenilin by expressing said nucleic acid. 

10 

41 . A substantially pure preparation of a protein selected from the group 
consisting of a mammalian S5a, GT24. p0071, Rabl 1, retinoid X receptor-p, 
cytoplasmic chaperonin, Y2H35, Y2H171, and Y2H41 protein. 

15 42. A substantially pure preparation of a polypeptide comprising an amino 

acid sequfence of at least 10 consecutive amino acid residues selected from the group 
consisting SEQ ED NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, and GenBank Accession 
Numbers F08730, T18858, X81889, X56740, X53143, M84820, X63522, M81766, 
UI7104, X74801, R12984, D55326, and T64843. 

20 . 

43. A substantially pure preparation of a polypeptide as in claim 42 

comprising an amino acid sequence of at least 15 consecutive amino acid residues 
selected from said group. 
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44. A substantially pure preparation of a polypeptide comprising at least a 

presenilin-interacting domain of a presenilin-interacting protein selected from the 
group consisting of a mammalian S5a, GT24, p0071, Rabl I, retinoid X receptor-p, 
cytoplasmic chaperonin, Y2H35. Y2H171, and Y2H41 protein. 



45. A substantially pure preparation of a polypeptide comprising an antigenic 

determinant of a presenilin-imeracting protein selected from the group consisting of a 



mammalian S5a, GT24, p0071, Rabl 1, retinoid X receptor-p, cytoplasi 
10 chaperonin, Y2H35, Y2H171, and Y2H41 protein. 



mic 



46. A method of producing antibodies which selectively bind to a presenilin- 

interacting protein comprising the steps of 

administering an immunogenically effective amount of a presenilin- 
interacting protein inomunogen to an animal; 

allowing said animal to produce antibodies to said immuriogen; and 
obtaining said antibodies from said animal or torn a cell cuinire derived 

therefrom. 



20 47. A substantially pure preparation of an antibody which selectively binds to 

an antigenic determinant of a presenilin-interacting protein selected from the group 
consisting of a mammalian S5a, GT24. p0071. Rabll, retinoid X receptor-p, 
cytoplasmic chaperonin, Y2H35, Y2H171, and Y2H4I protein. 



SUBSTITUTE SHEET (RULE 26) 



wo 97/27296 



PCT/CA97/00051 



-122- 

48. A substantially pure preparation of an antibody as in claim 47 wherein said 

antibody selectively binds to an antigenic detenminant of a mutant presenilin- 
interacting protein and fails to bind to a normal presenilin-interacting protein. 

5 49. A cell line producing an antibody of any one of claims 47-48. 

50. A method for identifying compounds which can modulate the expression 

of a presenilin-interacting protein gene comprising 

contacting a cell with a test candidate wherein said cell includes a 
10 regulatory region of a presenilin-interacting protein gene operably joined to a coding 
region; and 

detecting a change in expression of said coding region. 



51. A method as in claim 50 wherein said change comprises a change in a 
15 level of an mRNA transcript encoded by said coding region. 

52. A method as in claim 50 wherein said change comprises a change in a 
level of a protein encoded by said coding region. 

20 53. A method as in claim 50 wherein said change is a result of an activity of a 

protein encoded by said coding region. 
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54. A method as in claim 50 wherein said coding region encodes a marker 

protein selected from the group consisting of P-galactosidase, alkaline phosphatase, 
green fluorescent protein, and luciferase. 



55. A method for identifying compounds which can selectively bind to a 

presenilin-interacting protein comprising the steps of 

providing a preparation including at least one presenilin-interacting protein 
component; 

contacting said preparation with a sample including at least one candidate 
compound; and 

detecting binding of said presenilin-interacting protein component to said 
candidate compound. 



56. The method in 55 wherein said binding to said presenilin-interacting 

component is detected by an assay selected from the group consisting of: affinity 
chromatography, co-immunoprecipitation, a Biomolecular Interaction Assay, and a 
yeast two-hybrid system. 



57. A method ofidentifying compounds which can modulate activity of a 

presenilin-interacting protein comprising the steps of 

providing a cell expressing a normal or mutant presenilin-interacting 
protein gene; 

contacting said cell with at least one candidate compound; and 
detecting a change in a marker of said activity. 
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58. A method as in claim 57 wherein measurement of said marker indicates a 
difference between cells bearing an expressed mutant presenilin-interacting protein 
gene and otherwise identical cells free of an expressed mutant presenilin-interacting 
protein gene. 

5 

59. A method as in claim 57 wherein said change comprises a change in a non- 
specific marker of cell physiology selected from the group consisting of pH; 
intracellular Ca"*, Na*, or K"; cyclic AMP levels; GTP/GDP ratios; 
phosphatidylinositol activity; and protein phosphorylation. 

10 

60. A method as in claim 57 wherein said change comprises a change in 
expression of said presenilin-interacting protein. 

61 . A method as in claim 57 wherein said change comprises a change in 

15 intracellular concentration or flux of an ion selected from the group consisting of Ca^*, 
Na* and Yi\ 

62. A method as in claim 57 wherein said change comprises a change in 
occurrence or rate of apoptosis or cell death. 

20 

63. A method as in claim 57 wherein said change comprises a change in 
production of Ap peptides. 
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64. 



A method as in claim 57 wherein said change comprises a change in 



phosphorylation of at least one microtubule associated protein. 

65. A method as in claim 57 wherein said cell is a cell cultured in vitro . 

5 

66. A method as in claim 65 wherein said cell is a transformed host cell of any 
one of claims 30-34. 



67. A method as in claim 65 wherein said cell is explanted from a host bearing 

10 at least one mutant presenilin-interacting protein gene. 



68. A method as in claim 65 wherein said cell is explanted from a transgenic 

animal of any one of claims 35-39. 



A method as in claim 57 wherein said cell is a cell in a live animal. 



70. A mediod as in claim 69 wherein said cell is a cell of a transgenic animal 

of any one of claims 35-39. 



71. A method as in claim 57 wherein said cell is in a human subject i 

clinical trial. 
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72. A diagnostic method for determining if a subject bears a mutant presenilin- 
interacting protein gene comprising the steps of 

providing a biological sample of said subject; 
5 detecting in said sample a mutant presenilin-interacting protein nucleic 

acid, a mutant presenilin-interacting protein, or a mutant presenilin-interacting protein 
activity. 

73. A method as in claim 72, wherein a mutant presenilin-interacting protein 
10 nucleic acid is detected by an assay selected from the group consisting of direct 

nucleotide sequencing, probe specific hybridization, restriction enzyme digest and 
mapping, PGR mapping, ligase-mediated PGR detection, RNase protection, 
electrophoretic mobility shift detection, and chemical mismatch cleavage. 

15 74, A method as in claim 72, wherein a mutant presenilin-interacting protein is 

detected by an assay selected from the group consisting of an immunoassay, a 
protease assay, and an electrophoretic mobility assay. 

75, A pharmaceutical preparation comprising a substantially pure presenilin- 
20 interacting protein and a phamiaceutically acceptable carrier. 

76. A pharmaceutical preparation comprising an expression vector operably 
encoding a presenilin-interacting protein, wherein said expression vector may express 
said presenilin-interacting protein in a hiunan subject, and a pharmaceutical ly 

25 acceptable carrier. 
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10 



20 



77. A phannaceutical preparation comprising an expression vector operably 

encoding a presenilin-interacting protein antisense sequence, wherein said expression 
vector may express said presenilin-interacting protein antisense sequence in a human 
subject/and a phannaceutically acceptable carrier. 



78. A pharmaceutical preparation comprising a substantially pure antibody, 

wherein said antibody selectively binds to a mutant presenilin-interacting protein, and 
a phannaceutically acceptable carrier. 



is 



79. A phannaceutical preparation as in claim 78 wherein said preparation ] 

essentially free of an antibody which selectively binds a normal presenilin-interacting 
protein. 



15 80. A phannaceutical preparation comprising a substantially pure preparation 

of an antigenic detenninant of a mutant presenilin-interacting protein. 



81. A phannaceutical preparation as in claim 80 wherein said preparation is 

essentially free of an antigenic detenninant of a nonnal presenilin-interacting protein. 



82. A method of treatment for a patient bearing a mutant presenilin-interacting 

protein gene comprising the step of administering to said patient a therapeutically 
effective amount of the pharmaceuucal preparation of any one of claims 75-81. 
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83. A method as in claim 82, wherein said phannaceutical preparation is 

targeted to a cell type is selected from the group consisting of heart, brain, lung, liver, 
skeletal muscle, kidney, pancreas and neurological cells. 
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